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Chapter II procedure. If the application proceeds into the regional phase 
before the EPO, the applicant is reminded that a search may be carried 
out during examination before the EPO (see EPO Guideline C-VI, 8 5) 
should the problems which led to the Article 17(2) declaration be 
overcome. 
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modified with 2'-0-methyl and 2'-F-modifications (Cook, 1993, Antisense Research and 
Applications, CRC Press, 1 50- 1 8 1 ). 

Agrawai et al., US Patent No. 5,652,355, describe a phosphorothioate-containing 
nucleic acid molecule with at least two 2 '-O-methyl modifications on the 5' and 3' ends. 
5 Agrawai, US Patent No. 5,652,356, describes an oligonucleotide which consists of 

a region of 2'-0-substituted oligonucleotide located between two 
oligodeoxyribonucleotide regions. The DNA regions of this nucleic acid molecule 
consists of phosphorothioate modifications at every position. 

, Cook et al., US Patent No. 5,623,065, describe the use of a nucleic acid molecule 
10 which contains an RNase H cleavable region flanked by certain specifically modified 
nucleotides, for inhibition of gene expression of a ras gene. 

Cook et al., US Patent No. 5,587,362, describe a nucleic acid molecule having 
"substantially chirally pure inter-sugar linkages", for modulation of gene expression. 

Ohtsuka et al., US Patent No. 5,013;830, describe mixed oligomers having a DNA 
1 5 region and a 2'-0-methyl modified region, usefiil for modulation of gene expression. 

Waldcr et al., US Patent No. 5,491,133, describe a method for modulating gene 
expression using chimeric oligonucleotides with 3'-phosphodiester linkage modifications. 

Cohen et al., US Patent No. 5,276,019, and Cohen et al., US Patent No. 5,264,423 
describe the use of oligodeoxynucleotides of no more than 32 nucleotides in length, 
20 containing at least one phosphorothioate intemuclcoside linkage which are capable of 
preventing foreign nucleic acid replication. 

Cohen et al, US Patent No. 5,286,717, describe an oligodeoxyribonucleotide with 
at least one phosphorothioate modification capable of inhibiting oncogenes. 

Sproat et al., US Patent No. 5,334,7 11, describe 2'-0-R modified hammerhead and 
25 hairpin ribozymes, where R is alkyl. alkynyl or alkenyl. 

Crooke et al., 1996, Exp. Opin. Ther. Patents 6, 855, list and discuss various 
patents and PCT publications in the field of antisense technology. 

Sproat et al., US Patent No. 5,678,731, describe 2'-0-R modified oligonucleotides 
where R is alkyl, alkynyl or alkenyl. 
30 Usman et al., US Patent No. 5,652,094, describe enzymatic nucleic acid molecules 

which include nucleic acid analogues or deoxyribonucleotides. 

Joyce, International Publication No. WO 96/17086, describes a DNA en^me 
capable of cleaving RNA. 

Rossi et al., US Patent No. 5,144,019, describe chimeric hammerhead ribozymes 
35 with the binding arms and stem II region modified with deoxyribonucleotides. 
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Molecules have also been devised which include non-nucleotides capable of 
binding to nucleic acid. These peptide nucleic acid (PNA) molecules bind by Watson- 
Cnck base-pairing and may also function through an antisense mechanism. These 
molecules have been used to augment hammerhead ribozyme activity by altering the 
s^cture of target RNAs and increasing accessibility of cleavage sites (Jankowsky ./ 
1997, Nucleic Acids Research 25, 2690-2693). 

Summary of the Inventinn 

This invention relates to novel nucleic acid molecules which are useful for 
modulation of gene expression. The nucleic acid molecule of the instant invention are 
distinct fiom other nucleic acid molecules known in the art. Specifically, the nucleic acid 
molecules of the present invention have novel combuiations of chemical modifications and 
are capable of binding to RNA or DNA to facilitate modulation of gene expression. These 
novel combinations of chemical modifications may be used to form antisense 
oligonucleotides, triplex forming oligonucleotides. 2-5A antisense chimera, and enzymatic 
nucleic acid molecules. 

In a preferred embodiment, the invention features a nucleic acid molecule havmg 
the following formulae: 
Formula I: 




(X)ht-(Y)„— (X)„— C 3- 
(Y)n (X)ir-C" 3" 



5- C— (X)r (Y)n C 3- 

In a preferred embodiment, the invention features an enzymatic nucleic acid 
molecule having the formula: 
Formula IV: 

5" C (X)iir— (Z)p (X)o- (Y)„— (X) — c 3- 
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Formula V: 

5" C (X)iH-(Y)„-(X)o7(Z)p-(X)q— C 3- 

Formula VI: 

5- C (Y)„ (Z)p— (X)— C 3' 

5 Formula VII 

5- C (X)q (Z)p— (Y)„ C 3- 

In each of the above formula (I-VII), X represents independently a nucleotide 
which may be same or different; where m and o are integers independently greater than or 
equal to 4 and preferably less than about 100, more specifically 5, 6, 7, 8, 9, 10, 1 1, 12, 15, 

10 or 20; r is an integer greater than or equal to four, more specifically 5, 6, 7, 10, 15, or 20; 
the nucleic acid molecule may be symmetric (m = O) or asymmetric (m O); (X)^,, (X)o. 
and (X)q are oligonucleotides which are of sufiBcient length to stably interact 
independently with a target nucleic acid molecule (the target can be an RNA, DNA or 
RNA/DNA mixed polymers); Y represents independently a deoxyribonucleotide which 

15 may be same or different; n is an integer greater than or equal to 4, specifically 5, 6 7, 8, 9, 
10, 11, or 12; Z represents an oligonucleolide including nucleotides capable of facilitating 
the cleavage of a target sequence; p is of length greater than or equal to 4 but less than 
100, preferably 5, between 10-20, specifically 25-55, specifically between 30-45, more 
specifically 35-50; q is an integer greater than or equal to 0, preferably 1, 2, 3, 4, 5, 6, 7, 8, 

20 10, 15,20; represents a chemical linkage (e.g. a phosphate ester linkage, amide 

linkage or others known in the art); and each (X)^,, (X)o, (X)r, (X)q, and/or (Y)„ 
independently comprise phosphorothioate linkages, more specifically each (X)^, (X)o, 
(XX> (X)q, and/or (¥)„ independently comprise at least one phosphodiester linkage and one 
phosphorothioate linkage; each C and C independently represents a cap structure which 

25 may independently be present or absent; and (Z)p may optionally include a 
phosphorothioate linkage. The nucleotides in the each of the formula I-VII are 
unmodified or modified at the sugar, base, and/or phosphate as known in the art. 

Preferably, each of X represents independently a nucleotide \)v4uch may be same or 
different; where m and o are integers independently greater than or equal to 5; (X)^ and 

30 (X)o are oligonucleotides which are of sufficient length to stably interact independently 
with a target nucleic acid molecule; each (X)r comprises independently at least one 
phosphodiester linkage and one phosphorothioate linkage; Y represents independently a 
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deoxynbonucleotide which may be same or different; (Y)„ is an oligonucleotide which is 
of sufficient length to stably interact independently witii a target nucleic acid molecule- n 
IS an mteger greater than or equal to 4; each (X)., and (X)o comprise independenlly'at 
least one phosphodiester linkage and one phosphorothioate linkage; (Y)„ comprises a 
phosphorothioate linkage or a phosphorodithioate linkage or a 5'-S-phosphon,thioate or 
5'-S-phosphorodithioate, or a S'-S-phosphorothioate or a 3'-S-phosphorodithioate linl^e 
or a mixture thereof; and each C and C independently represents a cap structure which 
may independently be present or absent. 

By "nucleotide" as used herein is as recognized in the art to include natural bases 
(standard), and modified bases well known in the art. Such bases are generally located at 
the r position of a sugar moiety. Nucleotide generally comprise a base, sugar and a 
phosphate group. The nucleotides can be unmodified or modified at the sugar phosphate 
and/or base moiety, (also referred to interchangeably as nucleotide analogs, modified 
nucleotides, non-natural nucleotides, non-standard nucleotides and other; see for example 
Usman and McSwiggen. suprcr, Eckstein et al. International PCX Publication No WO 
92/07065; Usman et al. International PCT Publication No. WO 93/15187- Uhlman & 
Peyman, supra) all are hereby incoiporated by reference herein). There are several 
examples of modified nucleic acid bases known in the art and has recently been 
summarized by Limbach et al, 1994, Nucleic Acids Res. 22. 2183. Some of the non- 
limiting examples of base modifications that can be introduced into nucleic acids include 
mosme, purine, pyridin-4-one, pyridin-2-one, phenyl, pseudouracil, 2, 4, 6-trimethoxy 
benzene, 3-methyl uracil, dihydrouridine, naphthyl, aminophenyl, 5-alkylcytidines (eg. 
5-methyIcytidine), 5-alkyluridines (e.g., ribothymidine), 5-haIouridine ie g. 
5-bromoundine) or 6-a2apyrimidines or 6-alkyIpyrimidines {e.g. 6-methyl uridine)' 
propyne. and others (Burgin et d., 1996, Biochemistry, 35, 14090; Uhlman & Peyman' 
supra\ By "modified bases" in this aspect is meant nucleotide bases other than adenine' 
guamne, cytosine and uracil at 1' position or their equivalents; such bases may be used ai 
any position, for example, within the catalytic core of an enzymatic nucleic acid molecule 
and/or in the substrate-binding regions of the nucleic acid molecule. 

By "unmodified nucleotide" is meant a nucleotide with one of the bases adenine 
cytosine, guanine, thymine, uracil joined to the 1' carbon of P-D-ribo-furanose. 

By "modified nucleotide" is meant a nucleotide which contains a modification in 
the chemical structure of an umnodified nucleotide base, sugar and/or phosphate. 

By "sufficient length" is meant an oligonucleotide of greater than or equal to 4 
nucleotides. 
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By "stably interact" is meant, interaction of the oligonucleotides with target 
nucleic acid (e.g., by forming hydrogen bonds with complementary nucleotides in the 
target under physiological conditions). The interaction is stable either alone or in 
conjunction with (Y)„ and (Z)p where applicable. ; 
5 By "chimeric nucleic acid molecule" or "chimeric oligonucleotide" is meant that, 

the molecule may be comprised of both modified or unmodified DNA or RNA. 

By "cap structure" is meant chemical modifications which have been incorporated 
at the terminus of the oligonucleotide. These terminal modifications protect the nucleic 
acid molecule from exonuclease degradation, and may help in delivery and/or localization 
10 within a cell. 

In another preferred embodiment (X)^, (X)o, (X)q, (Y)„ and/or (Z)p independently 
include modifications selected from a group comprising 2'-0-alkyl {e.g. 2'-(3-allyl; Sproat 
et ai, supra); 2'-0-alkylthioalkyl (e.g. 2'-0-methylthiomethyl; Karpeisky el al, 1998, 
Nucleosides & Nucleotides 16, 955-958); L-nucleotides (Tazawa et al., 1970, 

15 Biochemistry 3499; Ashley, 1992, J. Am. Chem. Soc. 114, 9731; Klubmann el al., 1996, 
Nature Biotech 14, 1112); 2'-C-alkyl (Beigehnan et ai, 1995,7. Biol. Chem. 270, 25702); 
1-5-Anhydrohexitol; 2,6-diaminopurine (Strobel et ai, 1994, Biochem. 33, 13824-13835); 
2'-(Af-alanyl) amino-2'-deoxynucleotide; 2'-(Ar-beta-alanyl) amino; 2'.deoxy-2'-(lysyl) 
amino; 2'-0-amino (Karpeisky et al., 1995, Tetrahedron Lett. 39, 1131); 2'-deoxy-2'-(A^- 

20 histidyl) amino; 5-metiiyl (Strobel, supra); 2'-(A'^-b-carboxamidine-beta-aIanyl) amino; 2'- 
deoxy-2'-(N-beta-alanyl) (Mamlic-Adamic et al., 1995, Bioorg. & Med Chem. Lett. 
5,2721-2724); xylofuranosyl (Rosemeyer et ai, 1991, Helvetica Chem. Acta, 74, 748; 
Seela et al., 1994, Helvetica Chem. Acta, 77, 883; Seela et al., 1996, Helvetica Chem. 
Acta. 79, 1451). 

25 In a preferred embodiment, the invention features a nucleic acid molecule of any of 

formula I-VII, where each X and/or Z, independently include a nucleotide modification 
having formula IX: 

D 



0 OR, 

I 

E 
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15 



20 



25 



Where, each B is independently a modified or an unmodified nucleic acid base; Rl 
is independently a fluoroalkyl or an alkylthiofluoroaUcyl; E is independently a phosphorus- 
containing group; and D is independently an O, blocking group or a phosphorus- 
containing group. 

In another preferred embodiment, the invention feamres a nucleic acid molecule of 
any of formula I-VII, where each X and/or Z, independently include a nucleotide 
modification havuig formula X: 



D 



Ov 




OR, 

X 



Wherein, each B is independently a modified or an unmodified nucleic acid base; 
Rl is independently an aklyl, an alkylthioalkyl, a fluoroalkyl or an alkylthiofluoroalkyl; E 
is mdependently a phosphorus-contaming group; and D is independently an O, blocking 
group or a phosphorus-containing group. 

An "alkyl" group refers to a saturated aliphatic hydrocarbon, including straight- 
chain, branched-chain, and cyclic alkyl groups. Preferably, the alkyl group has 1 to 12 
carbons. More preferably it is a lower alkyl of firom 1 to 7 carbons, more preferably 1 to 4 
carbons. The alkyl group may be substituted or unsubstituted. When substituted the 
substituted group(s) is preferably, hydroxyl, cyano, alkoxy, =0, =S, NO2 or N(CH3)2, 
amino, or SH. The term also includes alkenyl groups which are unsaturated hydrocarbon 
groups containing at least one carbon-carbon double bond, including straight-chain, 
branched-chain, and cyclic groups. Preferably, the alkenyl group has 1 to 12 carbons! 
More preferably it is a lower alkenyl of fi-om 1 to 7 carbons, more preferably 1 to 4 
carbons. The alkenyl group may be substituted or unsubstituted. When substituted the 
substituted group(s) is preferably, hydroxyl, cyano, alkoxy, =0, =S, NO2, halogen, 
N(CH3)2, amino, or SH. The term "alkyl" also includes alkynyl groups which have arl 
unsaturated hydrocarbon group containing at least one carbon-carbon triple bond, 
including straight-chain, branched-chain, and cyclic groups. Preferably, the alkynyl group 
has 1 to 12 carbons. More preferably h is a lower alkynyl of from 1 to 7 carbons, more 
preferably I to 4 carbons. The alkynyl group may be substituted or unsubstituted. When 
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substituted the substituted group(s) is preferably, hydroxyl, cyano, alkoxy, =0, =S, NO2 
or N(CH3)2, amino or SH. 

Such alkyl groups may also include aryl, alkylaryl, carbocyclic aryl, heterocyclic 
aryl, amide and ester groups. An "aryl" group refers to an aromatic group which has at 
5 least one ring having a conjugated p electron system and includes carbocyclic aryl, 
heterocyclic aryl and biaryl groups, all of which- may be optionally substituted. The 
preferred substituent(s) of aryl groups are halogen, trihalomethyl, hydroxyl, SH, OH, 
cyano, alkoxy, alkyl, alkenyl, alkynyl, and amino groups. An "alkylaryl" group refers to 
an alkyl group (as described above) covalently joined to an aryl group (as described above. 
10 Carbocyclic aryl groups are groups wherein the ring atoms on the aromatic ring are all 
carbon atoms. The carbon atoms are optionally substituted. Heterocyclic aryl groups are 
groups having fixjm 1 to 3 heteroatoms as ring atoms in the aromatic ring and the 
remainder of the ring atoms are carbon atoms. Suitable heteroatoms include oxygen, 
sulfur, and nitrogen, and include furanyl, Ihienyl, pyridyl, pyrrolyl, N-lower alkyl pyirolo, 
15 pyrimidyl, pyrazinyl, imidazolyl and the like, all optionally substituted. An "amide" refers 
to an -C(0)-NH-R, where R is either alkyl, aryl, alkylaryl or hydrogen. An "ester" refers 
to an -C(0)-OR', where R is either alkyl, aryl, alkylaryl or hydrogen. 

A "blocking group" is a group which is able to be removed after polynucleotide 
synthesis and/or which is compatible with solid phase polynucleotide synthesis. 
20 A "phosphorus containing group" can include phosphorus in forms such as 

dithioates, phosphoramidites and/or as part of an oligonucleotide. 

In yet another preferred embodiment C is selected from a group comprising ' 
inverted abasic residue,. 4',5'-methylene nucleotide; I-(beta-D-erythrofiiranosyl) 
nucleotide, 4'-thio nucleotide, cari)ocyclic nucleotide; 1,5-anhydrohexitol nucleotide; L- 
25 nucleotides; alpha-nucleotides; modified base nucleotide; phosphorodithioate linkage; 
fWo-pentofiiranosyl nucleotide; acyclic 3',4'-seco hucleotide; acyclic 3,4-dihydroxybutyl 
nucleotide; acyclic 3,5-dihydroxypentyl nucleotide, 3 '-3'-inverted nucleotide moiety; 3'-3'- 
inverted abasic moiety; 3'-2'-inverted nucleotide moiety; 3'-2'-inverted abasic moiety; 1,4- 
butanediol phosphate; 3'-phosphoramidate; hexylpho^hate; aminohexyl phosphate; 3'- 
30 phosphate; 3'-phosphorothioate; phosphorodithioate; or bridging or non-bridging 
methylphosphonate moiety (for more details see-Beigelman et al. International PCT 
publication No. WO 97/26270, incorporated by refefence herein). 

In yet another preferred embodiment C is selected from a group comprising, 4',5'- 
methylene nucleotide; l-(beta-D-erythrofiiranosyl) nucleotide; 4'-thio nucleotide, 
35 carbocyclic nucleotide; 5'-amino-alkyl phosphate; 1 ,3-diamino-2-propyl phosphate, 3- 
aminopropyl phosphate; 6-aminohexyl phosphate; 1 ,2-aminododccyl phosphate; 
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hydroxypropyl phosphate; 1,5-anhydrohcxitol nucleotide; L-nucleotide; alpha-nucleotide; 
modified base nucleotide; phosphorodithioate; /Wpentofuranosyl nucleotide; acyclic 
3',4'-seco nucleotide; 3,4-dihydroxybutyl nucleotide; 3,5-dihydroxypentyl nucleotide, 5'- 
5'-inverted nucleotide moeity; 5'-5'-inverted abasic moeity; 5'-phosphoramidate;' 5'- 
phosphorothioate; l,4.butanediol phosphate; S'-amino; bridging and/or non-bridging 5'- 
phosphoramidate, phosphorothioate and/or phosphorodithioate, bridging or non bridging 
metiiylphosphonate and 5'-mercapto moeities (for more details see Beaucage and Iyer, 
1993, Tetrahedron 49, 1925; incorporated by reference herein). 

In another preferred embodiment (Z)p includes a non-nucleotide linker. Thus, in a 
preferred embodiment, the invention feahires an enzymatic nucleic acid molecule having 
one or more non-nucleotide moieties, and having enzymatic activity to cleave an RNA or 
DNA molecule. By tiie term "non-nucleotide" is meant any group or compound which can 
be incoiporated into a nucleic acid chain in the place of one or more nucleotide units, 
including either sugar and/or phosphate substitutions, and allows the remaining bases to 
exhibit tiieir enzymatic activity. The group or compound is abasic in that it does not 
contain a commonly recognized nucleotide base, such as adenosine, guanine, cytosme, 
uracil or thymine. The terms "abasic" or "abasic nucleotide" as used herein encompass 
sugar moieties lacking a base or having other chemical groups in place of base at tiie 1' 
position. 

By the phrase "enzymatic nucleic acid" is meant a nucleic acid molecule capable of 
catalyzing (altermg tiie velocity and/or rate of) a variety of reactions including the ability 
to repeatedly cleave otiier separate nucleic acid molecules (endonuclease activity) in a 
nucleotide base sequence-specific manner. Such a molecule witii endonuclease activity 
may have complementarity in a substrate bindmg region {e.g. (X)^ (X)o, (X), and (¥)„ in 
formulae IV-VII) to a specified gene target, and also has an enzymatic activity that 
specifically cleaves RNA or DNA in tiiat target. That is, the nucleic acid molecule with 
endonuclease activity is able to intramolecularly or intermolecularly cleave RNA or DNA 
and thereby inactivate a target RNA or DNA molecule. This complementarity functions to 
allow sufficient hybridization of the enzymatic RNA molecule to the target RNA or DNA 
to allow the cleavage to occur. 100% complementarity is preferred, but complementarity 
as low as 50-75% may also be useful in this invention. The nucleic acids may be modified 
at tiie base, sugar, and/or phosphate groups. The term enzymatic nucleic acid is used 
interchangeably with phrases such as ribozymes, catalytic RNA, enzymatic RNA, catalytic 
DNA. nucleozyme, DNAzyme, RNA enzyme, endoribonuclease, endonuclease, 
minizyme, leadzyme, oligozyme, chimeric ribozyme, chimeric enzymatic nucleic acid, or 
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DNA enzyme. All of these terminologies describe nucleic acid molecules with enzymatic 
activity. 

By "complementarity" is meant a nucleic acid that can form hydrogen bond(s) with 
other RNA sequence by either traditional Watson-Grick or other non-traditional types (for 
5 example, Hoogsteen type) of base-paired interactions. 

By "oligonucleotide" as used herein, is meant a molecule comprising two or more 
nucleotides. 

By "enzymatic portion" is meant that part of the enzymatic nucleic acid molecule 
essential for cleavage of a nucleic acid substrate 

10 By "substrate binding region" or "substrate bindmg domain" is meant that 

portion/region of a nucleic acid molecule (e.g. ribozyme) which is complementary to (/.e, 
able to base-pair with) a portion of its substrate. Generally, such complementarity is 
100%, but can be less if desired. For example, as few as 10 bases out of 14 may be base- 
paired. Such arms are shown generally in Figures 1 and 3. That is, in a ribozyme example, 

15 these arms contain sequences within a ribozyme which are intended to bring ribozyme and 
target RNA together through complementary base-pairing interactions. The ribozyme of 
the invention may have bindmg arms that are contiguous or non-^ntiguous and may be of 
varying lengths. The length of the binding arm(s) are preferably greater than or equal to 
four nucleotides; specifically 12-100 nucleotides; more specifically 14-24 nucleotides 

20 long. If two binding arms are chosen, the design is such that the length of the binding 
arms are symmetrical (i.e., each of the binding arms is of the same length; e.g„ five and 
five nucleotides, six and six nucleotides or seven and seven nucleotides long) or 
asynunetrical (/.e.. the binding arms are of different length; e.g., six and three nucleotides; 
three and six nucleotides long; four and five nucleotides long; four and six nucleotides 

25 long; four and seven nucleotides long; and the like). 

By "DNAzyme" or "catalytic DNA" or "DNA enzyme" is meant, an enzymatic 
nucleic acid molecule lacking a 2*-0H group. 

By "nucleic acid molecule" as used herein is meant a molecule comprising 
nucleotides. The nucleic acid can be composed of modified or unmodified nucleotides or 

30 non-nucleotides or various mixtures and combinations thereof 

In another preferred embodiment, the nucleic acid molecule of the present 
invention is conjugated with another moiety including but not limited to abasic 
nucleotides, polyether, polyamine, polyamides, peptides, carbohydrates, lipid, or 
polyhydrocarbon compounds. Those skilled in the art will recognize that these molecules 

35 may be linked to one or more of any nucleotides comprising the nucleic acid molecule at 
several positions on the sugar, base or phosphate group. 
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In yet another preferred embodiment, the nucleic acid molecule of the present 
invention can form stmctures including but not limited to antisense. triplexes, 2-5A 
chimera antisense, or enzymatic nucleic acid (ribozymes). 

By "antisense" is meant a non-enzymatic nucleic acid molecule that binds to target 
RNA, for example, by means of RNA-RNA or RNA-DNA or RNA-PNA (protein nucleic 
acid; Egholm et al, 1993 Nature 365, 566) interactions and alters the activity of the target 
RNA (for a review see Stein and Cheng, 1993 Science 261 , 1004). 

By ''2-5A antisense chimera" it is meant, an antisense oligonucleotide containing a 
5' phosphorylated 2'.5'-linked adenylate residues. These chimeras bind to target RNA in a 
sequence-specific manner and activate a cellular 2-5A-dependent ribonuclease which in 
turn cleaves the target RNA (Torrence etal, 1993 Proc. Natl Acad. Sci. USA 90, 1300). 

By "triplex DNA" it is meant an oligonucleotide that can bind to a double-stranded 
DNA in a sequence-specific manner to form a triple-strand helix. Triple-helix formation 
has been shown to inhibit transcription of the targeted gene (Duval-Valentin et al. mi 
Proc. Natl Acad. Sci. USA 89, 504). 

In another preferred embodiment, the invention features an antisense 
oligonucleotide which is capable of interacting with the target RNA and sterically 
blocking translation, where the oligonucleotide has a 5' and a 3' Cap structure and the 
oligonucleotide may include modifications at the base, sugar or the phosphate groups. 

Other features and advantages of the invention will be apparent from the following 
description of the preferred embodiments thereof, and from the claims. 

Description of the Prefer red Emhodimenfs 

The drawings will first briefly be described. 

Drawings: 

Figure 1 is a diagrammatic representation of a nucleic acid molecule with 7-9 
phosphorothioate oligodeoxyribonucleotide sequence flanked by 9 non- 
deoxyribonucleotide containing oligonucleotides, binding to a target molecule. 

Figure 2 displays schematic representations of certain chemical structures which 
may be incorporated into the nucleic acid molecule of the invention. 

Figure 3 shows the secondary structure model for seven different classes of 

enzymatic nucleic acid molecules. Arrow indicates the site of cleavage. indicate 

the target sequence. Lines interspersed with dots are meant to indicate tertiary interactions. 
- is meant to indicate base-paired interaction. Group I Intr n: P1-P9.0 represent various 
stem-loop structures (Cech etal, 1994, Nature Struc. Bio., 1, 273). RNase P (MIRNA): 
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EGS represents external guide sequence (Forster et al, 1990, Science, 249, 783; Pace et 
a/., 1990, 1 Biol Chem., 265, 3587). Group II Intron: 5'SS means 5' splice site; 3' SS 
means 3 '-splice site; IBS means intron binding site; EBS means exon binding site (Pyle et 
ai, 1994, Biochemistry, 33, 2716). VS RNA: I-VI are meant to indicate six stem-loop 
5 structures; shaded regions are meant to indicate tertiary interaction (Collins, International 
PCX Publication No. WO 96/19577). HDV Ribozyme: : I-IV are meant to indicate four 
stem-loop structures (Been et al, US Patent No. 5,625,047). Hammerhead Ribozyme: : 
I-III are meant to indicate three stem-loop structures; stems WII can be of any length and 
may be symmetrical or asymmetrical (Usman et al, 1996, Curr Op. Struct, Bio,, 1, 527). 

10 Hairpin Ribozyme: Helix 1, 4 and 5 can be of any length; Hclbc 2 is between 3 and 8 
base-pairs long; Y is a pyrimidine; Helix 2 (H2) is provided with a least 4 base pairs (i.e., 
n is 1, 2, 3 or 4) and helix 5 can be optionally provided of length 2 or more bases 
(preferably 3 - 20 bases, i.e., m is from 1 - 20 or more). Helix 2 and helix 5 may be 
covalently linked by one or more bases (i.e., r is 1 base). Helix 1, 4 or 5 may also be 

15 extended by 2 or more base pairs {e.g., 4 - 20 base pairs) to stabilize the ribozyme 
structure, and preferably is a protein binding site. In each instance, each N and N' 
independently is any normal or modified base and each dash represents a potential base- 
pairing interaction. These nucleotides may be modified at the sugar, base or phosphate. 
Complete base-pairing is not required in the helices, but is preferred. Helix 1 and 4 can be 

20 of any size (i.e., o and p is each independently from 0 to any number, e.g., 20) as long as 
some base-pairing is maintained. Essential bases are shown as specific bases in the 
structure, but those in the art will recognize that one or more may be modified chemically 
(abasic, base, sugar and/or phosphate modifications) or replaced with another base without 
significant effect. Helix 4 can be formed fi-om two separate molecules, i.e., without a 

25 connecting loop. The connecting loop when present may be a ribonucleotide with or 
wittiout modifications to its base, sugar or phosphate, "q" is 2 bases. The connecting 
loop can also be replaced with a non-nuclcotide linker molecule. H refers to bases A, U, 

or C. Y refers to pyrimidine bases. " " refers to a covalent bond. (Burke et al, 1996, 

Nucleic Acids & Mol Biol, 10, 129; Chowrira et ah US Patent No. 5,631,359). 

30 Figure 4 is a graph comparing cell proliferation rates of MCF-7 cells treatment 

with active and inactive ribozyme with mismatch arms targeted to estrogen receptor 
delivered with GSV transfection reagent. The sequence for the active ribozyme is given as 
Seq. ID. No.2725 and the inactive is 2726. 

Figure 5 is a graph comparing RNA levels of c-raf RNA in PC-3 cells following 

35 treatment with antisense (Seq. ID. No 27 1 7) and scrambled antisense (mismatch) controls. 
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Figure 6a and 6b displays several possible ribozymes comprising 

ohgodeoxyribonucleotides. The symbols used in the diagram include: N' represents a 

nucleotide complementary to a nucleotide on the target molecule; N7 represents position 7 

in the ribozyme molecule (Hertel, K. J., et ai. 1992, Nucleic Acid, R,. 20, 3252); N' 

represents a deoxyribonucleotide complementary to a nucleotide on the target molecule; s 

represents a phosphorothioate modification; C represents a chemical modification at the 5' 

end of the ribozyme; and C represents a chemical modification at the 3' end. 

Figure 7 shows examples of nucleotide modifications for incorporation into 
oligonucleotides. 

Figure 8 is a graph demonstrating the level of c-raf mRNA in PC-3 cells following 
treatment with c-raf sequence tai:geting antisense nucleic acid molecules. ITie antisense 
molecules target regions within the intron/exon junction (Seq. I.D. Nos.273 1-2735). intron 
(Seq. I.D. 2736) and exon (Seq. I.D. 2737). Tlie results demonstrate the molecules'ability 
to inhibit c-raf mRNA compared to the untreated or mismatch control at two 
concentrations. 

Figure 9 is a graph which demonstrates the ability of an antisense molecule 
represented by Seq. I.D. No. 2738 (phosphorothioate modifications at every 
deoxynucleotide position) to inhibit c-raf message in PC-3 cells over a period of five days 
compared to the untreated and mismatch controls at three different concentrations. 

Figure 10 is a graph which demonstrates the ability of an antisense molecule 
represented by Seq. I.D. No. 2737 (tiiree phosphorothoate modified nucleotides at both the 
5' and 3' ends of tiie oligonucleotide) to inhibit c-raf message in PC-3 cells over a period 
of five days compared to tiie untreated control at three different concentrations. 

Figure 11 is a graph which demonstrates tiie ability of an antisense molecule 
represented by Seq. I.D. No. 2744 (9 phosphorothioate modified DNA fianked by 7 2'-0- 
metiiylthiometiiyl RNA nucleotides at the 5' and 3' end) to inhibit c-raf message in PC-3 
cells over a period of five days compared to the untieated and mismatch control at tiiree . 
different concentrations. 

Figure 12 is a graph showing the level of cellular proliferation inhibition exhibited 
by antisense oligonucleotides represented by Seq. I.D. Nos. 2738 and 2741. Also shown . 
witiiin tiie graph is the cellular proliferation after treatment of cells witii a mismatch 
control (Seq. I.D. 2739). 

Figure 13 is a graph showing the ability of oligonucleotides with different 
chemical modifications to inhibit c-raf mRNA. Each antisense molecule is compared to 
an untreated and mismatch control. 
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Figure 14 is a graph demonstrating a dose dependent inhibition of C-raf in PC-3 
cells following treatment with antisense oligonucleotides (Seq. I.D. Nos. 2741 and 2738). 

Figure 15 is a graph showing inhibition of bcl-2 mRNA by antisense 
oligonucleotides compared to untreated and mismatch controls. 
5 Figure 16 is a graph showing the ability of several k-ras targeting antisense 

molecules to inhibit k-ras message 

Figure 17 is a graph showing the ability of oligonucleotides with different 
chemical modifications to inhibit estrogen receptor mRNA. Each antisense molecule is 
compared to an untreated and mismatch control. 
10 Figure 18 shows a scheme for the synthesis of 3'-deoxy-3'-thio guanosine 

nucleoside (scheme I). 

Figure 19 shows a scheme for the synthesis of S-(pyridyl-2-disuIfanyl) derivative 
(scheme 2). 

Figure 20 shows a scheme for the synthesis of 3'-deoxy.3'-thio guanosine 
15 phosphoramidite. 

Figure 21 shows a scheme for the preparation of 5'-thio-nucleoside 
phosphoramidite and succinates. 

Figure 22 shows a scheme for the synthesis of 3 '-thio-2'-0-methyl uridine. 

Synthesis of Nucleic acid Molecules 

20 Synthesis of nucleic acids greater than 100 nucleotides in length is difficult using: 

automated methods, and the therapeutic cost of such molecules is prohibitive. In this 
invention, small nucleic acid motifs (e.g., antisense oligonucleotides, hammerhead or the- 
hairpin ribozymes) are used for exogenous delivery. The simple structure of these 
molecules increases the ability of the nucleic acid to invade targeted regions of RNA 

25 structure. The molecules of the instant invention were chemically synthesized. 
Oligodeoxyribonucleotides were synthesized using standard protocols as described in 
Caruthers et al, 1992, Methods in Enzymology 21 1,3-19, and is incorporated by reference. 

The method of synthesis used for normal RNA including certain enzymatic nucleic 
acid molecules follows the procedure as described in Usman et al, 1987 J, Am, Chem. 

30 Soc, 109, 7845; Scaringe et al, 1990 Nucleic Acids Res., 18, 5433; and Wincott et al„ 
1995 Nucleic Acids Res. 23, 2677-2684 and makes use of common nucleic acid protecting 
and coupling groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites at the 3'- 
end. In a non-limiting example, small scale synthesis were conducted on a 394 Applied 
Biosystems, Inc. synthesizer using a modified 2.5 jxmol scale protocol with a 5 min 

35 coupling step for alkylsilyl protected nucleotides and 2.5 min coupling step for T-O- 



wo 99/54459 



PCT/US99/08547 



16 

methylated nucleotides. Table II outlines the amounts, and the contact times, of the 
reagents used in the synthesis cycle. A 6.5-fold excess (163 of 0.1 M = 16.3 ^mol) of 
phosphoramidite and a 24-fold excess of S'-ethyl tetrazole (238 nL of 0.25 M = 59.5 ^moI) 
relative to polymer-bound 5'-hydroxyl was used in each coupling cycle. Average coupling 
5 yields on the 394 Applied Biosystems, Inc. synthesizer, determined by colorimetric 
quantitation of the trityl fractions, were 97.5-99%. Other oligonucleotide synthesis 
reagents for the 394 Applied Biosystems, Inc. synthesizer; detritylation solution was 2% 
TCA in methylene chloride (ABI); capping was performed with 16% A^-methyl imidazole 
in THF (ABI) and 10% acetic anhydride/10% 2,6-lutidine in THF (ABI); oxidation 
10 solution was 16.9 mM I2, 49 mM pyridine, 9% water in THF (Millipore). B & J Synthesis 
Grade acetonitrile was used directly from the reagent bottle. 5-EthyI tetrazole solution 
(0.25 M in acetonitrile) was made up from the solid obtained from American International 
Chemical, Inc. 

Deprotection of the RNA was perfonned as follows. The polymer-bound 
15 oligoribonucleotide, trityl-off, was transferred from the synthesis column to a 4mL glass 
screw top vial and suspended in a solution of methylamine (MA) at 65 °C for 10 min. 
After cooling to -20 °C, the supernatant was removed from the polymer support. The 
support was washed three times with 1.0 mL of EtOH:MeCN:H20/3:l:l, vortexed and the 
supernatant was then added to the first supernatant. The combined supematants, 
20 containing the oligoribonucleotide, were dried to a white powder. 

The base-deprotected oligoribonucleotide was resuspended in anhydrous 
TEA-HF/NMP solution (250 fiL of a solution of 1.5mL iV-methylpyrrolidinone, 750 \xL 
TEA and 1.0 mL TEA-3HF to provide a 1.4M HP concentration) and heated to 65°C for 
1.5 h. The resulting, fully deprotected, oligomer was quenched with 50 mM TEAB (9 
25 mL) prior to anion exchange desalting. 

For anion exchange desalting of the deprotected oligomer, the TEAB solution was 
loaded onto a Qiagen 500® anion exchange cartridge (Qiagen Inc.) that was prewashed 
with 50 mM TEAB (10 mL). After washing the loaded cartridge with 50 mM TEAB (10 
mL), the RNA was eluted with 2 M TEAB (10 mL) and dried down to a white powder. 
30 Inactive hammerhead ribozymcs were synthesized by substituting a U for G5 and a 

U for A 14 (numbering from Hertel, K. J., et ai, 1992, Nucleic Acids Res. . 20, 3252). 

The average stepwise coupling yields were >98% (Wincott et ai, 1995 Nucleic 
Acids Res. 23, 2677-2684). 

Alternatively, the nucleic acid molecules of the present invention can be 
35 synthesized separately and joined together by ligation (Moore et at., 1992, Science 256, 
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9923; Draper et al. International PCT publication No. WO 93/23569; Shabarova et al, 
1 991 , Nucleic Acids Research 1 9, 4247) 

Administration of Nucleic Acid Molecules 

Methods for the delivery of nucleic acid molecules is described in Akhtar et ai, 
5 1992, Trends Cell Bio,, 2, 139; and Delivery Strategies for Antisense Oligonucleotide 
Therapeutics, ed. Akhtar, 1995 which are both incorporated herein by reference. Sullivan 
et al , PCX WO 94/02595, fiirther describes the general methods for delivery of enzymatic 
RNA molecules . These protocols may be utilized for the delivery of virtually any nucleic 
acid molecule. Nucleic acid molecules may be administered to cells by a variety of 

10 methods known to those familiar to the art, including, but not restricted to, encapsulation 
in liposomes, by iontophoresis, or by incorporation into other vehicles, such as hydrogels, 
cyclodextrins, biodegradable nanocapsules, and bioadhesive microspheres. For some 
indications, nucleic acid molecules may be directly delivered ex vivo to cells or tissues 
with or without the aforementioned vehicles. Alternatively, the nucleic acid/vehicle 

15 combination is locally delivered by direct injection or by use of a catheter, infusion pump 
or stent. Other routes of delivery include, but are not limited to, intravascular, 
intramuscular, subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), : 
topical, systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed 
descriptions of nucleic acid delivery and administration are provided in Sullivan et al- 

20 supra and Draper et al, PCT W093/23569 which have been incorporated by reference: 
herein. . 

The molecules of the instant invention can be used as pharmaceutical agentsk 
Pharmaceutical agents prevent, inhibit the occurrence, or treat (alleviate a symptom to 
some extent, preferably all of the symptoms) of a disease state in a patient. 

25 The negatively charged polynucleotides of the invention can be administered (e.g., 

RNA, DNA or protein) and introduced into a patient by any standard means, with or 
without stabilizers, buffers, and the like, to form a pharmaceutical composition. When it 
is desired to use a liposome delivery mechanism, standard protocols for formation of 
liposomes can be followed. The compositions of the present invention may also be 

30 formulated and used as tablets, capsules or elixirs for oral administration; suppositories for 
rectal administration; sterile solutions; suspensions for injectable administration; and the 
like. 

The present invention also includes pharmaceutically acceptable formulations of 
the compounds described. These formulations include salts of the above compounds, eg., 
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acid addition salts, for example, salts of hydrochloric, hydiobromic, acetic acid, and 
benzene sulfonic acid. 

A pharmacological composition or formulation refers to a composition or 
fomiulation in a form suitable for administration, e.g.. systemic administration, into a cell 
or patient, preferably a human. Suitable fonns, in part, depend upon the use or the route of 
entry, for example oral, transdermal, or by injection. Such foms should not prevent the 
composition or formulation to reach a target cell {i.e., a cell to which the negatively 
charged polymer is desired to be delivered to). For example, pharmacological 
compositions injected into the blood stream should be soluble. Other factors are known in 
the art. and include considerations such as toxicity and forms which prevent the 
composition or formulation fix)m exerting its effect. 

By "systemic administration" is meant in vivo systemic absorption or 
accumulation of drugs in the blood stream followed by distribution throughout the enUre 
body. Administration routes which lead to systemic absorption include, without 
limitations: intravenous, subcutaneous, intraperitoneal, inhalation, oral, intrapulmonary 
and mtramuscular. Each of these administration routes expose the desired negatively 
charged polymers, e.g.. nucleic acids, to an accessible diseased tissue. The rate of entry 
of a drug into the circulation has been shown to be a function of molecular weight or size. 
The use of a liposome or other drug carrier comprising the compounds of the instant 
mvention can potentially locaUze the drug, for example, in certain tissue types, such as the 
tissues of the reticular endothelial system (RES). A liposome formulation which can 
facihtate the association of drug with the surface of cells, such as, lymphocytes and 
macrophages is also useful. This approach may provide enhanced delivery of the drug to 
target cells by taking advantage of the specificity of macrophage and lymphocyte immune 
25 recognition of abnormal cells, such as the cancer cells. 

The invention also features the use of the a composition comprising smface- 
modified liposomes containing poly (ethylene glycol) lipids (PEG-modified, or long- 
circulating liposomes or stealth liposomes). These formulations offer an method for 
increasing the accumulation of drugs in target tissues. This class of drug carriers resists 
opsonization and elimination by the mononuclear phagocytic system (MPS or RES) 
thereby enabling longer blood circulation times and enhanced tissue exposure for the 
encapsulated drug (Lasic et al. Chem. Rev. 1995, 95, 2601-2627; Ishiwata et al Chem 
Pharm. Bull 1995, 43, 1005-1011). Such liposomes have been shown to accumulate 
selectively in tumors, presmnably by extravasation and captiire in the neovascularized 
target tissues (Lasic et al.. Science 1995, 267, 1275-1276; Oku et al 1995 Biochim 
Biophys. Ada, 1238, 86-90). The long-circulating liposomes enh'ance the 
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pharmacokinetics and pharmacodynamics of DNA and RNA, particularly compared to 
conventional cationic liposomes which are known to accumulate in tissues of the MPS 
(Liu et a/,, J. Biol Chem. 1995, 42, 24864-24870; Choi et al, International PCT 
Publication No. WO 96/10391; Ansell et ai, International PCT Publication No. WO 
96/10390; Holland et al. International PCT Publication No. WO 96/10392; all of these 
are incorporated by reference herein). Long-circulating liposomes are also likely to 
protect drugs from nuclease degradation to a greater extent compared to cationic 
liposomes, based on their ability to avoid accumulation in metabolically aggressive MPS 
tissues such as the liver and spleen. All of these references are incorporated by reference 
herein. 

The present invention also includes compositions prepared for storsfge or 
administration \^ch include a pharmaceutically effective amount of the desired 
compounds in a pharmaceutically acceptable carrier or diluent. Acceptable carriers or 
diluents for therapeutic use are well known in the pharmaceutical art, and are described, 
for example, in Remington's Pharmaceutical Sciences, Mack Publishing Co. (A.R. 
Gennaro edit. 1985) hereby incorporated by reference herein. For example, preservatives, 
stabilizers, dyes and flavoring agents may be provided. Id at 1449. These include sodium 
benzoate, sorbic acid and esters of />-hydroxybenzoic acid. In addition, antioxidants and 
suspending agents may be used. Jd 

A pharmaceutically effective dose is that dose required to prevent, inhibit the 
occurrence, or treat (alleviate a symptom to some extent, preferably all of the symptoms) 
of a disease state. The pharmaceutically effective dose depends on the type of disease, the 
composition used, the route of administration, the type of mammal being treated, the - 
physical characteristics of the specific mammal under consideration, concurrent 
medication, and other factors which those skilled in the medical arts will recognize. 
Generally, an amount between 0.1 mg/kg and 100 mg/kg body weight/day of active 
ingredients is administered dependent upon potency of the negatively charged polymer. 

Mechanism of action of Nucleic Acid Molecules of the Invention 

Antisense : Antisense molecules may be RNA or DNA oligonucleotides and 
primarily function by specifically binding to matching sequences resulting in inhibition of 
peptide syndiesis (Wu-Pong, Nov 1994, BioPharm, 20-33). The oligonucleotide binds to 
target RNA by Watson Crick base-pairing and blocks gene expression by preventing 
ribosomal translation of the bound sequences. Antisense molecules may also alter protein 
synthesis by interfering with RNA processing or transport from the nucleus into the 
cytoplasm (Mukhopadhyay & Roth, 1996, Crit, Rev. in Oncogenesis 7, 151-190). 
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We 26H41 1-1418; S^, 1„3. ,„ 89-93; Kumar e, a,.. 995, ,^Eb7^ 
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1183; Breaker, 1996, Curr Op. Biotech,, 7, 442; Santoro et ai, 1997, Proc. Nail Acad, 
ScL, 94, 4262; Tang etaL 1997, RNA 3, 914; Nakamaye & Eckstein, 1994, supra; Long & 
Uhlenbeck, 1994, supra; Ishizaka et ah, 1995, supra; Vaish et ai, 1997, Biochemistry 36, 
6495; all of these are incorporated by reference herein). Each can catalyze a series of 
reactions including the hydrolysis of phosphodiester bonds in trans (and thus can cleave 
other RNA molecules) under physiological conditions. Table I summarizes some of the 
characteristics of some of these ribozymes. In general, enzymatic nucleic acids act by first 
binding to a target RNA. Such binding occurs through the target binding portion of an 
enzymatic nucleic acid which is held m close proximity to an enzymatic portion of the 
molecule that acts to cleave the target RNA. Thus, the enzymatic nucleic acid first 
recognizes and then binds a target RNA through complementary base-pairing, and once 
bound to the correct site, acts enzymatically to cut the target RNA. Strategic cleavage of 
such a target RNA will destroy its ability to direct synthesis of an encoded protein. After 
an enzymatic nucleic acid has bound and cleaved its RNA target, it is released from that 
RNA to search for another target and can repeatedly bind and cleave new targets. 

The enzymatic nature of a ribozyme is advantageous over other technologies, since 
the concentration of ribozyme necessary to affect a therapeutic treatment is lower. This 
advantage reflects the ability of the ribozyme to act enzymatically. Thus, a single 
ribozyme molecule is able to cleave many molecules of target RNA. In addition, the 
ribozyme is a highly specific inhibitor, with the specificity of inhibition depending not 
only on the base-pairing mechanism of binding to the target RNA, but also on the 
mechanism of target RNA cleavage. Single mismatches, or base-substitutions, near the 
site of cleavage can be chosen to completely eliminate catalytic activity of a ribozyme. 

Nucleic acid molecules having an endonuclease enzymatic activity are able to 
repeatedly cleave other separate RNA molecules in a nucleotide base sequence-specific 
manner. Such enzymatic nucleic acid molecules can be targeted to virtually any RNA 
transcript, and efficient cleavage achieved in vitro (Zaug et al, 324, Nature 429 1986 ; 
Uhlenbeck, 1987 Nature 328, 596; Kim et al., 84 Prpc. Natl Acad Sci USA 8788, 1987; 
Dreyftis, 1988, Einstein Quart. 1 Bio. Med,, 6, 92; Haselofif and Gerlach, 334 Nature 585, 
1988; Cech, 260 JAhdA 3030, 1988; and Jefferies et^l., 17 Nucleic Acids Research 1371, 
1989; Santoro et al, 1997 supra). 

Because of their sequence-specificity, rraw^-cleaving ribozymes show promise as 
therapeutic agents for human disease (Usman & McSwiggen, 1995 Ann. Rep. Med. Chem, 
30, 285-294; Christoffersen and Marr, 1995 J. Med Chem. 38, 2023-2037). Ribozymes 
can be designed to cleave specific RNA targets within the background of cellular RNA. 
Such a cleavage event renders the RNA non-functional and abrogates protein expression 
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from that RNA. In this manner, synthesis of a protein associated with a disease state can 
be selectively inhibited. 

Optimizing Ribozvme Activity 

Catalytic activity of the ribozymes described in the instant invention can be 
optimized as described by Draper et al., supra. The details will not be repeated here, but 
include altering the length of the ribozyme binding anns, or chemically synthesizing 
ribozymes with modifications (base, sugar and/or phosphate) that prevent their 
degradation by serum ribonucleases and/br enhance their enzymatic activity (see e.g., 
Eckstein et al.. International Publication No. WO 92/07065; Peirault et al., 1990 Nature 
344, 565; Pieken et al., 1991 Science 253, 314; Usman and Cedergren, 1992 Trends in 
Biochem. Set 17, 334; Usman et al., International PubUcation No. WO 93/15187; and 
Rossi et al.. International Publication No. WO 91/03162; Sproat, US Patent' No. 
5,334,711; and Burgin et al, supra; all of these describe various chemical modifications 
that can be made to the base, phosphate and/or sugar moieties of enzymatic RNA 
molecules). Modifications which enhance their efficacy in cells, and removal of bases 
from stem loop strucnires to shorten RNA synthesis times and reduce chemical 
requirements are desired. (AH these publications are hereby incorporated by reference 
herein). 

There are several examples in the art describing sugar, base and phosphate 
modifications that can be introduced into enzymatic nucleic acid molecules without 
significantly effecting catalysis and with significant enhancement in their nuclease 
stability and eflScacy. Ribozymes are modified to enhance stability and/or enhance 
catalytic activity by modification with nuclease resistant groups, for example, 2'-amino, 2'- 
C-allyl, 2'-flouro, 2'-0-methyl, 2'-H, nucleotide base modifications (for a review see 
Usman and Cedergren, 1992 17, 34; Usman e/a/.. 1994 Nucleic Acids Symp. Ser. 31, 
163; Burgin et al., 1996 Biochemistry 35, 14090). Sugar modification of enzymatic 
nucleic acid molecules have been extensively described in the art (see Eckstein et al.. 
International Publication PCT No. WO 92/07065; Perrault et al Nature 1990, 344, 565- 
568; Pieken et al. Science 1991 , 253, 314-3 17; Usman and Cedergren, Trends in Biochem. 
Sci. 1992, 77, 334-339; Usman e/fl/. International Publication PCT No. WO 93/15187; 
Sproat, US Patent No. 5,334,711 and Beigelraan et al.. 1995 J. Biol Chem. 270, 25702,' 
all of the references are hereby incorporated in their totality by reference herein). Such 
publications describe general methods and strategies to determine the location of 
incorporation of sugar, base and/or phosphate modifications and the like into ribozymes 
without inhibiting catalysis, and are incorporated by reference herein. In view of such 
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teachings, similar modifications can be used as described herein to modify the nucleic acid 
catalysts of the instant invention. 

Nucleic acid catalysts having chemical modifications which maintain or enhance 
enzymatic activity are provided. Such nucleic acid is also generally more resistant to 
nucleases than unmodified nucleic acid. Thus, in a cell and/or in vivo the activity may not 
be significantly lowered. As exemplified herein such ribozymes are useful in a cell and/or 
in vivo even if activity over all is reduced 10 fold (Burgin et al, 1996, Biochemistry, 35, 
14090). Such ribozymes herein are said to "maintain" the enzymatic activity on all RNA 
ribozyme. 

Therapeutic ribozymes delivered exogenously must optimally be stable within cells 
until translation of the target RNA has been inhibited long enough to reduce the levels of 
the undesirable protein. This period of time varies between hours to days depending upon 
the disease state. Clearly, ribozymes must be resistant to nucleases in order to function as 
effective intracellular therapeutic agents. Improvements in the chemical synthesis of RNA 
(Wincott et al, 1995 Nucleic Acids Res. 23, 2677; incorporated by reference herein) have 
expanded the ability to modify ribozymes by introducing nucleotide modifications to 
enhance their nuclease stability as described above. 

By "enhanced enzymatic activity" is meant to include activity measured in cells 
and/or in vivo where the activity is a reflection of both catalytic activity and ribozyme 
stability. In this invention, the product of these properties is increased or not significantly 
(less that 10 fold) decreased in vivo compared to an all RNA ribozyme. 

In yet another preferred embodiment, nucleic acid catalysts having chemical 
modifications which maintain or enhance enzymatic activity is provided. Such nucleic 
acid is also generally more resistant to nucleases than unmodified nucleic acid. Thus, in a 
cell and/or in vivo the activity may not be significantly lowered. As exemplified herein 
such ribozymes are useful m a cell and/or in vivo even if activity over all is reduced 10 
fold (Burgin et aL 1996, Biochemistry, 35, 14090): Such ribozymes herein are said to 
"maintain" the enzymatic activity on all RNA ribozyme. 

Inhibition Of Estrogen Receptor Gene Expression 

Breast Cancer is one of the leading causes of death in women (Jiang and Jordan, 
1992, J. Natl Cancer Inst, 84, 580-591). There has been an intense effort to understand 
the molecular mechanisms for hormonal regulation^ of cell proliferation in breast cancer 
over the last several decades. It has been shown that many breast and endometrial cancers 
are dependent on estrogen for their growth and progression (Borras et al, 1994, J, Steroid 
Biochem. Molec, Biol 48, 325-336). Estrogen receptor plays a pivotal role in these cancers 
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and thus controlling the expression of this gene is of paramount intcn^st to researcher, and 
chmaans. The estrogen receptor is a member of the steroid hormone receptor gene family 
that displays it's biological function as a ligand binding-dependent transcription factor 
Tamoxifen ,s a nonsteroidal antiestrogcn which treats ail stages of breast cancer and may 
' preventative compound in those predisposed to breast cancer (Jordan and 
Murphy, 1990, /?ev. 1 1; 578-610). 

Most bi^ast tumors are initially dependem upon estrogen for growth, and the 
. ^trogen receptor has been a key indicator for endocrine response, prognosis and survival 
from breast cancer. The MCF-y human br^t cancer cell line expresses high levels of 
10 estrogen receptor and is responsive to the effects of added estrogen (Borras et al 1996 J. 
S'eroidBiochem. Molec. Biol 57, 203-213; Pink and Jordan, 1996, Cancer Res. 56, 232U 
2330). They are an excellent model system to smdy tlie effects of regulation of estrogen 
receptor m breast cancer. Ribozymes and antisense oUgonucleotides .^present a direct 
means of affecting the levels of estrogen receptor message. In estrogen dependem cell 
15 hues, decreased amounts of estrogen receptor transcript should lower overall amounts of 
estrogen receptor protein and prevent proliferation of those cells. The effects of estrogen 
receptor on sexual differentiation of brain has been examined using antisense 
oligonucleotides (McCarthy et al., 1993 Endocrinology 133, 433-439). This application 
docurnents the effects of ribozymes and antisen^ oligonucleotides to estrogen receptor 
RNA levels and proliferation in MCF-7 cells. 

Estrogen receptor may be inhibited using nucleic add molecules, including the 
nucleic acid molecules of the present invention. Other references describe the use of 
antisense molecules to down regulate estrogen receptor RNA (Defazio et al 1997 Cell 
Growth Differ. 8, 903-911; Santagati et al., 1997, Mol. Endocrinol. 11. 938-949- WUliard 
25 etal., m4,Gene 149,21-24; Jiang & Jordan, ^«prfl). 

The nucleic acid molecules may be chemically synthesized and delivered using 
methods described above, or may be expressed within cells from eukaryotic promoters 
(e.g.. Izant and Weintraub, 1985 Science 229, 345; McGany and Lmdquist, 1986 Proa 
Natl. Acad Sci. USA 83, 399; Scanlon etal., 1991, Proc. Natl. Acad Sci. USA 88 10591- 
5; Kashani-Sabete/a/.. 1992 Antisense Res. Dev.. 2,3-15; DropnWc etal 1992 J: Virol. 
66,J432-41; Weerasinghe a/., 1991J F/.o/, 65, 5531-4; Ojwang 1992 Proc. 

Natl. Acad Sci. USA 89, 10802-6; Chen et al.. 1992 Nucleic Acids Res ' 20 458I-9' 
Sarverero/.. 1990 247, 1222-1225; Thompson ./., 1995 Nucleic Acicb Res 23 

2259; Good al.. 1997, Gene Therapy, 4, 45; all of the references are hereby incorporated' 
in their totality by reference herein). Those skilled in the art realize that any nucleic acid 
can be expressed in eukaryotic cells from the appropriate DNA/RNA vector The activity 
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of such nucleic acids can be augmented by their release from the primary transcript by a 
ribozyme (Draper et ai, PCX WO 93/23569, and Sullivan et al., PCT WO 94/02595; 
Ohkawa^/aA, 1992 Nucleic Acids Symp. Ser, 27,15-6; Tairae/a/., 1991, Nucleic Acids 
Res,, 19,5125-30; Ventura e/ a/., 1993 Nucleic Acids Res., 2], 3249-55; Chov/rira et ai, 
5 1994 J. Biol. Chem. 269, 25856; all of the references are hereby incorporated in their 
totality by reference herein). 

One type of nucleic acid molecules knownras ribozymes, which can cleave target 
molecules, are expressed from transcription units (see for example Couture et ai, 1996, 
TIG,, 12, 510) inserted into DNA or RNA vectors.:The recombinant vectors are preferably 

10 DNA plasmids or viral vectors. Ribozyme expressing viral vectors could be constructed 
based on, but not limited to, adeno-associated virus, retrovirus, adenovirus, or alphavirus. 
Preferably, the recombinant vectors capable of expressing the ribozymes are delivered as 
described above, and persist in target cells. Alternatively, viral vectors may be used that 
provide for transient expression of ribozymes. Such vectors might be repeatedly 

15 administered as necessary. Once expressed, the ribozymes cleave the target mRNA. The 
active ribozyme contains an enzymatic center or core equivalent to those in the examples, 
and binding arms able to bind target nucleic acid molecules such that cleavage at the 
target site occurs. Other sequences may be present which do not interfere with such 
cleavage. Delivery of ribozyme expressing vectors could be systemic, such as by 

20 intravenous or intramuscular administration, by administration to target cells ex-planted 
from the patient followed by reintroduction into the patient, or by any other means that 
would allow for introduction into the desired target cell (for a review see Couture et ai, 
1996, 77G.. 12,510). 

In one aspect the invention features, an expression vector comprising nucleic acid 
25 sequence encoding at least one of the nucleic acid molecules of the instant invention is 
disclosed. The nucleic acid sequence encoding the nucleic acid catalyst of the instant 
invention is operable linked in a manner which allows expression of that nucleic acid 
molecule. 

In another aspect the invention features, the expression vector comprises: a 
30 transcription initiation region (e.g., eukaryotic pol I, II or III initiation region); b) a 
transcription termination region (e.g., eukaryotic pol I, II or III termination region); c) a 
gene encoding at least one of the nucleic acid catalyst of the instant invention; and wherein 
said gene is operably linked to said initiation region and said termination region, in a 
manner which allows expression and/or delivery of said nucleic acid molecule. The vector 
35 may optionally include an open reading frame (ORF) for a protein operably linked on the 
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5- side or the 3'-side of the gene encoding the nucleic acid catalyst of the invention; and/or 
an intron (intervening sequences). 

Transcription of the ribozyme sequences are driven from a promoter for eukaryotic 
RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase III (pol III). 
Transcripts from pol II or pol III promoters will be expressed at high levels in all cells; the 
levels of a given pol II promoter in a given cell type will depend on the nature of the gene 
regulatory sequences (enhancers, silencers, etc.) presem nearby. Prokaryotic RNA 
polymerase promoters are also used, providing that the prokaryotic RNA polymerase 
enzyme is expressed in the appropriate cells (Elroy-Stein and Moss, 1990 Proc. Natl. 
Acad. Sci. USA, 87, 6743-7; Gao and Huang 1993 Nucleic Acids Res., 21, 2867-72; 
Lieber et al., 1993 Methods Enzymol. 217, 47-66; Zhou et al., 1990 Mol. Cell. Biol., 10, 
4529-37). Several investigators have demonstrated that ribozymes expressed from Lh 
promoters can fimction in mammalian cells (e.g. Kashani-Sabet et al., 1992 Antisense 
Res. Dev.. 2, 3-15; Ojwang et al., 1992 Proc. Natl. Acad Sci. USA, 89, 10802-6; Chen 
et al., 1992 Nucleic Acids Res., 20, 4581-9; Yu et al., 1993 Proc. Natl. Acad. Sci. US A, 
90,6340-4; L'HuUlier a/., 1992 £M50 J. 11, 441 1-8; Lisziewicz e/ a/., 1993 Proc. 
Natl. Acad Sci. U. S A., 90, 8000-4; Thompson etai, 1995 Nucleic Acids Res. 2i, 2259; 
Sullenger & Cech, 1993, Science, 262, 1566). More specifically, transcription unite such 
as the ones derived from genes encoding U6 small nuclear (snRNA), transfer RNA 
(tRNA) and adenovirus VA RNA are usefiil in generating high concentrations of desired 
RNA molecules such as ribozymes in cells (Thompson et al, supra; Couture and 
Stinchcomb, 1996, supra; Noonberg etal. \99A, Nucleic Acid Res., 22, 2830; Noonberg et 
al. US Patent No. 5,624,803; Good et al, 1997, Gene Ther 4, 45; Beigelman et al 
International PCT Publication No. WO 96/18736; all of these publications are incorporated 
by reference herein. The above ribozyme transcription units can be incorporated into a 
variety of vectors for introduction into mammalian cells, includmg but not restricted to, 
plasmid DNA vectors, viral DNA vectors (such as adenovirus or adeno-associated virus 
vectors), or viral RNA vectors (.such as retroviral or alphavirus vectors) (for a review see 
Couture and Stinchcomb, 1996, supra). 

In yet another aspect the invention features an expression vector comprising 
nucleic acid sequence encoding at least one of the catalytic nucleic acid molecule of the 
invention, in a manner which allows expression of that nucleic acid molecule. The 
expression vector comprises in one embodiment; a) a transcription initiation region; b) a 
transcription termination region; c) a gene encoding at least one said nucleic' acid 
molecule; and wherein said gene is operably linked to said initiation region and said 
termination region, in a manner which allows expression and/or delivery of said nucleic 



wo 99/54459 



PCT/US99/08547 



27 

acid molecule. In another preferred embodiment the expression vector comprises: a) a 
transcription initiation region; b) a transcription termination region; c) an open reading 
frame; d) a gene encoding at least one said nucleic acid molecule, wherein said gene is 
operably linked to the 3 -end of said open reading frame; and wherein said gene is 
5 . operably linked to said initiation region, said open reading frame and said termination 
region, in a manner which allows expression and/or delivery of said nucleic acid molecule. 
In yet another embodiment the expression vector comprises: a) a transcription initiation 
region; b) a transcription termination region; c) an intron; d) a gene encoding at least one 
said nucleic acid molecule; and wherein said gene is operably linked to said initiation 

10 region, said intron and said termination region, in a manner which allows expression 
and/or delivery of said nucleic acid molecule. In another embodiment, the expression 
vector comprises: a) a transcription initiation region; b) a transcription termination region; 
c) an intron; d) an open reading frame; e) a gene encoding at least one said nucleic acid 
molecule, wherein said gene is operably linked to the 3*-end of said open reading frame; 

15 and wherein said gene is operably linked to said initiation region, said intron, said open 
reading frame and said termination region, in a manner which allows expression and/or 
delivery of said nucleic acid molecule. 

Target Validation 

One of the most challenging tasks in drug discovery is the choice of a therapeutic 

20 target. Historically, traditional biochemical and other studies have offered limited 
information in this regard. However, recent advances in genomics offer the potential to 
revolutionize both the speed and certainty of therapeutic target identification. Progress in 
characterizing the genes in the human genome has been very rapid, and it is now estimated 
that the entire complement of genes in the human genome may be sequenced before the 

25 end of this century. However, this mass of information is coming to the scientific world 
without a road map. Converting pure gene sequence information into a functional 
understanding of their role in human disease is proving to be a much more difficult 
problem. Even after a group of genes is associated with a particular disease, the process of 
validating which genes are appropriate for use as therapeutic targets is often slow and 

30 costly. Most companies with genomics activities now have access to myriad partial or full 
sequences, but do not possess adequate technologies to determine which of those 
sequences is an appropriate therapeutic target. As a result, only a few genes have been 
imequivocally identified as the causative agent for a specific disease. 

The nucleic acid molecules of the present invention can inhibit gene expression in 

35 a highly specific manner by bmding to and causing the cleavage of the mRNA 
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con-esponding to the gene of interest, and thereby prevent production of the gene product 
(Chnstoffersen, Nature Biotech, 1997. 2, 483-484). Appropriate delivery vehicles can be 
combined with these nucleic acid molecules (including polymers, cationic lipids 
liposomes and the like) and delivered to appropriate cell culture or in vivo animal disease' 
5 models as described above. By monitoring inhibition of gene expression and correlation 
with phenotypic results, the relative importance of the particular gene sequence to disease 
pathology can be established. The process may be both fast and highly selective and 
allow for the process to be used at any point in the development of the organism ' The 
novel chemical composition of these nucleic acid molecules may allowed for added 
1 0 stability and therefore increased efficacy. 

Examples 

The following are non-limiting examples demonstrating the utility of the nucleic 
acid molecules of the instant invention. Hiose m the art will recognize that certain 
experimental conditions such as temperatures, reaction times, media conditions 
transfection reagents and RNA assays are not meant to be limiting and can be readily 
modified without significantly altering the protocols. 

gxgmple 1 : Identifiration of Potential Nucl eic Acid Mnlernle Binding Site^ 

the sequences of target RNAs were screened for accessible sites using a computer 
folding algorithm. Regions of the mRNA that did not foim secondaiy folding structures 
were identified. For ribozyme sites, regions of mRNA that did not form secondary 
structure and contained potential hammerhead and/or haripin cleavage sites were 
identified. 

Example 2: Selection of Rihozvme r.leav5.p^ Sites in Rsti^p ^ Receptor RNA 

To test whether the sites predicted by the computer-based RNA folding algorithm 
corresponded to accessible sites in estrogen receptor. Ribozyme target sit«s were chosen 
by analyzing genomic sequences of Genbank Sequence HSERRl (Green et al 1986 
Nature 320, 134-139) and prioritizing the sites on the basis of folding. Ribozymes were' 
designed that could bind each target (see Figure 3) and were individually analyzed by 
computer folding (Christoffersene/ a/., 1994 J. Mol. Struc. Theochem, 311 273- Jaeger 
a/., 1989, Proc. Natl. Acad. ScL USA, 86. 7706) to assess whether the ribozyme 'sequences 
fold mto the appropriate secondary structure. Those ribozymes with unfavorable 
intramolecular interactions between the binding arms and the catalytic core were 
ehmmated from consideration. As noted below, vaiying binding ami lengths can be 
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chosen to optimize activity. Generally, at least 5 bases on each arm are able to bind to, or 
otherwise interact with, the target RNA. Hammerhead (Seq. ID. Nos. 1-1245) and haiqjin 
ribozymes (2491-2603) are listed in tables IV and V respectively. 

Example 3: Inhibition of c-raf RNA Targets using Nucleic Acid Molecules 
5 Prostate cancer cells (PC-3) were grown in a growth media consisting of Kaighn's 

F-12K media, 10% FBS, 1% glutamine, 20 mM HEPES, and 1% pen/strep to sub- 
confluent densities. A 4X concentration (10 Hg/mL) of GSV (Glen Research) was 
prepared from a 2 mg/mL stock solution as well as . a 10(iM solution of antisense and its 
scrambled (mismatch) control. Complexes of antisense and GSV were formed in a 96 

10 well plate by channel pipetting in antisense and GSV to form complex solutions which are 
twice the final concentrations. 50 ^iL of the complex solution and 50 ^iL of growth 
medium (without antibiotics) were added to PC-3 cells and incubated for 24 hours. The 
filial concentrations of antisense used were 400, 200, and 100 nM, while the GSV 
concentration was held constant at 2.5 |ig/mL. PC-3 cells were then harvested with 150 

15 of RLT lysis buffer (Qiagen). RNA was purified using Qiagen's instructions and RNA 
was quantified using Taqman reagents and the 7700 Prism (Perkin Elmer) using the 
manufacturer's protocol. The c-raf RNA concentration was normalized to the c-raf RNA 
concentrations of the scrambled controls. The antisense sequence (Seq. I.D. No. 2717) 
and the data is shown in table III A and figure 5. The antisense molecules were capable of 

20 reducing c-raf RNA levels up to 80% compared to the mismatch control in PC-3 cells at 
several concentrations of antisense molecules. 

Example 4: Ribozvme in vitro Cleavage Assay 

Ribozymes and complementary substrates were synthesized as described above. 
These ribozymes can be tested for cleavage activity in vitro, for example using the 

25 following procediu-e. The ribozyme sequences are shown in figure 7. 

Cleavage Reactions: Full-length or partially fiill-length, internally-labeled target 
RNA for ribozyme cleavage assay was prepared by in vitro transcription in the presence of 
[a-32p] cTp^ passed over a G 50 Sephadcx column by spin chromatography and used as 
substrate RNA without fiirther purification. Alternately, substrates may by 5 -32p-end 

30 labeled using T4 polynucleotide kinase enzyme. Assays were performed by pre-warming 
a 2X concentration of purified ribozyme in ribozyrne cleavage buffer (50 mM Tris-HCl, 
pH 7.5 at 37'*C, 10 mM MgCl2) and the cleavage reaction was initiated by adding the 2X 
ribozyme mix to an equal volume of substrate RNA (maximum of 1 -5 nM) that was also 
pre-warmed in cleavage buffer. The assays were carried out for 1 hour at 37°C using a 
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final concentration of either 1 ^M ribozyme, /.e, ribozyme excess. The reaction was 
quenched by the addition of an equal volume of 95% formamide, 20 mM EDTA, 0.05% 
bromophenol blue and 0.05% xylene cyanol after which the sample was heated to 95°C for 
2 minutes, quick chilled and loaded onto a denaturing polyacrylamide gel. Substrate RNA 
and the specific RNA cleavage products generated by ribozyme cleavage were visualized 
on an autoradiograph of the gel. The percentage of cleavage was determined by Phosphor 
Imager® quantitation of bands representing the intact substrate and the cleavage products. 
The ribozymes were able to cleave between 1-80.8% of their complementary substrates 
after 2 hours (table VI). 

Example 5: Inhibition of Cell Proliferation Using Estrogen Receptor Targeted Ribozvme 

MCF-7 cells were were grown in a T-TS flask to 80% confluency in growth media, 
which was prepared by mbcing 500 mL Alpha-MEM, 10 mL IM Hepes, 5mL sodium 
Pyruvate, 5 mL NEAA, 5 mL L-glutamine, 250 ^iL 2.0 mg/ml insulin, 500 iil Gentamycin, 
and 10% FBS followmg by sterile filtration. 

Ribozyme and cationic lipid was mixed as described in example 4 with final 
concentrations of 20, 40, and 80 nM ribozyme and I ng/mL GSV. The MCF-7 cells were 
treated with serum free alpha-MEM 24 hours prior to exposure to ribozyme/transfection 
reagent complexes. The complexes were added to the cells and allowed to continously 
transfect for 1,2 , 3, 4, and 5 days. At the end of each time point, the media was removed 
off the cells and prohferation was measured using a CyQuant kit (Molecular Probes). 
Fluorescence was measured at after 10 minutes of incubation at 485 nm (excitation) and 
538 (emission). Inhibition of cellular proliferation by active ribozyme was compared to 
inactive scrambled ribozyme controls. The sequence for the active ribozyme is given as 
sequence ID. No. 2725 and the inactive scrambled control is given as Seq. ID. No. 2726. 
The chimeric enzymatic nucleic acid was able to inhibit MCF-7 cellular proliferation at all 
of the tested concentrations (figure 4). 

Example 6: Inhibition of Estrogen Receptor RNA Targets using antisense nucleic acid 
molecules 

A 5X concentrated solution of oligonucleotide ( l|iM) and a lOX solution of GSV 
(25 |ig/mL) (Glen Research) was made prior to complexing. The 5X oligonucleotide 
solution (8 ^L), lOX GSV solution (40 ^L), and Optimem media (32 fiL) were mixed 
together and incubated at 37°C for 15 minutes. Media was aspirated off the cells followed 
by the addition of 80 ^iL of fresh growth media (Optimem media with 1 0% FBS, 1% non- 
essential amino acids, 1% sodium pyruvate, 20mM HEPES, 1 [ig/mL insulin) and 20 \iL 
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of 5X complex solution. The complex was left on the cells for 20 hours and then 
harvested with 150 fiL of RLT lysis buffer (Qiagen). RNA levels were then quantified 
using Taqman reagents and the 7700 Prism (Perkin Ebner) using the manufacturer's 
protocol. This cellular delivery assay was used for several RPI targets in varying cell lines 
5 and the data is shown in table IIIA. The levels of target RNA were either normalized to 
mismatch control RNA or to an intemal housekeeping gene such as actin. Antisense 
nucleic acid molecules, which are given as Seq. ID. Nos. 2718-2723, were able to knock 
down estrogen receptor RNA by varying degrees. The levels of RNA inhibition ranged 
from 48-64% depending on the antisense sequence. 

10 Example 7: Inhibition of Estrogen Receptor RNA using Ribozvmes 

Ribozymes and GSV transfection reagents were mixed together using a protocol 
similar to the one found m example 6. Target RNA was purified using a Qiagen kit. RNA 
levels were then quantified using Taqman reagents and the 7700 Prism (Perkin Elmer) 
using the manufacturer's protocol. The ribozyme specific to estrogen receptor is given as 

1 5 sequence ID. No. 2724 and was able to inhibit the gene by 50%. 

Example 8: Inhibition of c-Raf mRNA and Cellular Proliferation IJsmp Antisense Nucleic . 
Acid Molecules 

24^ Hour RNA endpoint assay: The effect of targeting intron, exon, and intron-exon 
junction sites with antisense molecules was tested on c-raf RNA using in vitro cell culture 

20 assays. Seven chimeric antisense oligonucleotides with various chemical modifications , 
were tested and the results were compared to an untreated control and an oligonucleotide • 
modified at every nucleotide position with a phosphorothioate modification. Prostate 
cancer cells (PC-3) were grown in growth media consisting of Kaighn's F-12K media, 
10% FBS, 1% glutamine, 20 mM HEPES, and 1% pen/strep in 96 well plates (15,000 cells 

25 per well). For the 24 hour delivery experiments, 2.5 mg/mL of GSV transfection reagent 
(Glen Research) was complexed to either 200 or 400 nM concentration of antisense 
molecules. The complex was left on the cells for 24 hours and the cells were then 
harvested with 150 ^iL of RLT lysis buffer (Qiagen). RNA levels were quantified using 
Taqman reagents and the 7700 Prism (Perkin Elmer) using the manufacturer's protocol. 

30 C-raf RNA levels were normalized to actin controls and the data is shown in Figure 8. All 
compounds demonstrated an ability to inhibit c-raf message. Of the compounds tested, 
sequence I.D. 2732, 2736 and 2737 seemed particularly effective. 

Sustained delivery RNA endpoint assay: Prostate cancer cells (PC-3) were grown in 
growth media consisting of Kaighn's F-12K media, 10% FBS, 1% glutamine, 20 mM 
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HEPES, and \% pen/strep in 96 well plates (15,000 cells per well). The cells were plated 
to 2500 cells per well in a 96 well plate and incubated at 37»C. Antisense nucleic acid 
molecules (Seq. I.D. Nos. 2738, 2737, 2744; Table IV) and GSV transfcction reagent were 
complexed as in described under example 4 to a final concentration of anUsense molecules 
of 100, 200 or 400 nM with 1.0 Mg/mL of GSV tr^fection reagent. For seq I D No 
2744, a mismatch control with similar base composition was also tested as a control The 
complex was left on the cells and allowed to continuously transfect for 1 3 or 5 days At 
the end of each time period, the cells were harvested with 150 of RLT lysis buffer 
(Qiagen); and RNA levels quantified using Taqman reagents and the 7700 Prism (Perkin 
Ebner) usmg the manufecturer's protocol. He inhibition of c-raf expression by molecules 
r^resented by Seq. I.D. Nos. 2738, 2737. and 2744 (table IV) are shown in Figures 9 10 
and 1 1 respectively. All three of these molecules demonstrated the ability to reduce c-ra^ 
RNA between 60-90% compared to the untreated control. 

Proliferation Assay: Chimeric oligonucleotides and delivery reagents were mixed 
15 as descnbed in example 6 where the fmal concentration of antisense oligonucleotide is 200 
nM (Seq. I.D. No.2738 and 2741) and GSV is 1 ,g/mL. A mismatch antisense control was 
also tested (Seq. I.D. No. 2739) in this experiment. The MCF-7 cells were treated with 
serum free alpha-MEM 24 hours prior to exposure to antisense/transfection reagent 
complexes. TTte complexes were added to the cells and allowed to continuously transfect 
tor 1, 3, or 5 days. At the end of each time point, the media was removed off the cells and 
proliferation was measured using a CyQuant kit (Molecular Probes). Fluorescence was 
measured after 10 minutes of incubation at 485 nm (excitation) and 538 (emission) 
Inhibition of cellular proliferation by active oligonucleotide was compared to scrambled 
mismatch controls. The antisense molecules targeting c-raf mRNA were able to inhibit 
25 cellular proliferation by up to 55%. 

Varying Chemical Modifications: Referring to Figure 13 and Table IV alterations 
of the chemical composition of antisense molecules were made while keeping the 
oligonucleotide sequence constant. All of the sequences were the same except for the 
mismatch controls and Seq. I.D. Nos. 2738 and 2739 (table IV) which were two 

30 nucleotides shorter than the others. Antisense molecules and GSV transfection reagent 
were mixed using the protocol described under example 6. The complexes were added to 
.the cells and allowed to continuously transfect for 24 hours. The cells were then harvested 
with 150 nL of RLT lysis buffer (Qiagen). RNA levels were then quantified using 
Taqman reagents and the 7700 Prism (Perkin Ebner) using the manufacturer's protocol 

J5 All c-raf levels were normalized using actin RNA as a control. Referring to Figure 13 
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antisense molecules which utilized phosphorothioate modifications, inverted abasic caps, 
2'-(?-methyl or 2'-0-methylthiomethyl modifications inhibit c-raf mRNA. 

Dose dependent Inhibition: 0, 50, 100 150, and 200 nM of antisense molecules 
(seq. LD. No. 2738 or 2737) were mixed with mismatch control antisense molecules to 
5 give a final antisense/mismatch antisense concentration of 200nM for each sample. The 
antisense nucleic acid and GSV were complexed as in example 6 with a final GSV 
transfection reagent concentration at 2.5 jig/mL. The complexes were added to the cells 
and allowed to continuously transfect for 24 hours. At the end of each time period, the 
cells were harvested with 150 \iL of RLT lysis buffer (Qiagen). RNA levels wei« then 
10 quantified using Taqman reagents and the 7700 Prism (Pericih Elmer) using the 
manufacturer's protocol. All c-raf levels were normalized using actin RNA as a control. 
The results (Figure 14) show that c-raf is inhibited in a dose dependent manner and that 
the IC50 is approximately 35 nM for each of the antisense molecules. 



Example 9: Ribon uclease Protection Assay of MCF-7 Cells treated with Antisense Nucleic 

15 Acid Molecules Targeting BCL-2 

MCF-7 cells were plated in RPMI 1640 (10% FBS, 1% l-glutaraine, 20 mM 
HEPES) at 100,000 cells per well for 24 hours. On the following day, the cells were 
treated with 150 nM of antisense nucleic acid molecules (Seq. I.D. Nos. 2749-2753: Table 
VIII) complexed with 5.4 ^iM of LipofectAMINE (LFA) for 4 hours. The antisense/LFA 

20 complex was then aspirated off and fresh RPMI 1640 media was added to the cells. 24 ' 
hours later the cells were harvested using RLT lysis buffer (Qiagen). A ribonuclease 
protection assay (Ambion) was then performed using the manufacturers protocol to 
quantitate RNA levels and then harvested with 150 ^iL of RLT lysis buffer (Qiagen). 
RNA levels were then quantified using Taqman reagents and the 7700 Prism (Perkin 

25 Elmer) using the manufacturer's protocol. BcI-2 RNA levels were normalized to GAPDH 
controls and is shown in Figure 15. The antisense oligonucleotides specifically inhibited 
Bcl-2 expression in MCF-7 cells. 

Example 10: Inhibition of k-ras in DLD-1 Cells 

96-well plates with DLD-1 ceils at 10,000 cells per well were plated in complete 
30 RPMI 1640 media ((10% FBS, 1% 1-glutamine, 20 mM HEPES, 1% pen/strep). Antisense 
molecules (Seq. I.D. Nos. 2754-2757; Table VIII) were complexed with GSV transfection 
reagent (Glen Research) using the method described in example 6. The final 
concentrations delivered to the cells were 200 nM antisense oligonucleotide and 1.25 
|ig/mL of GSV. The complex was added to the cells for 26 hours and at the end of the 
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time period, the cells were harvested with 150 of RLT lysis buffer (Qiagen). RNA 
levels were then quantified using Taqman reagents and the 7700 Prism (Perkin Elmer) 
using the manufacturer's protocol. All k-ras levels were nonnalized using actin RNA as a 
control. The data (Figure 16) demonstrates that the antisense molecules can inhibit 
approximately 50-90% k-ras expression compared to the untreated or mismatch controls. 

Example 11: Inliihirinn of Estrogen Receptor mRNA TF^inp i^ ntisense MnlPr„l..c nf 
varying Chemical Composition 

Alteration of the chemical composition of antisense molecules were made while 
keeping the oligonucleotide sequence constant. All of the sequences (Table VII- Seq I D 
Nos. 2758. 2760, 2762, 2764, 2766. 2768) were the same except for the mismatch controls 
(Table VII: Seq. I.D. Nos. 2759, 2761, 2763, 2765, 2767, 2769). Antisense molecules and 
GSV transfection reagem was mixed using the protocol described in example 6. The 
complexes were added to MCF-7 cells and allowed to continuously transfect for 24 hours. 
The cells were harvested with 150 nL of RLT lysis buffer (Qiagen). RNA levels were 
15 tiien quantified using Taqman reagents and tiie 7700 Prism (Perkin Ehner) using the 
manufacturer's protocol. All estrogen receptor mRNA levels were normalized using actin 
RNA as a control. The antisense molecules which included phosphorothioate 
modifications, inverted abasic caps, 2'-amethyl or 2'-0-methylthiomethyl modifications 
appeared to decrease estirogen receptor RNA (Figure 17). 
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Example 12: Synthesis of 3'-deoxv-3'-thio a..ano.i n e and k\ r-Thiot)ho.snhnr;,midifP 

Referring to Figures 18, Applicant has developed an efficiem method for tiie 
synthesis of 3'-deoxy.3'-thio guanosine (13) and its 3'-tiiiophosphoramidite 23 from 
guanosine. Reaction of suitably protected guanosine with a-acetoxyisobutyryl bromide 
(a-AIBBr) afforded stereoselectively 3'-deoxy-3'-bromo-2'-0-acetyl-b-D-xylofiiranosyl 
derivative 3 which was converted into the 7:3 mixture of 5-acyl ribofiiranosyl derivatives 
5 (or 6) and 3',4'-unsaturated derivative 4. 5-acylated derivatives 5 and 6 were then 
converted in three steps into 3'-deoxy-3'-5-pyridylsulfanyl-5'-0-(4,4'- 
dimethoxytrityOguanosine (II) which served as a common intennediate for the 
preparation of firee nucleoside 13 and 3 '-thiophosphoramidite 23. 

Oligonucleotides containing 3'-S-phosphorothiolate linkage have attracted 
increasing inlBrest as probes for studying the interaction of nucleic acids and their 
processing enzymes. In particular these analogs have been used in revealing the 
involvement of metal ions in phosphoester transfer reactions catalyzed by RNA (Piccirilli 
et ai, J. Am. Chem. Soc. 1996, J 18, 10341) and ribonucleoprotein enzymes (Sontheimer 
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et al, Nature 1997, 308, 801). The synthesis of 3'-5-phosphorothiolate linked 
deoxyribodinucleotides have been reported using solution chemistry and solid phase 
chemistry (Cosstick et aL, Nucleic Acids. Res. 1990, 18, 829; Li et al, Tetrahedron 1992, 
48, 2729; Li et a/., J. Chem. Soc, Ferkin Trans. 7 1994, 2123; Vyle et al, Biochemistry 
5 1992,31,3012). - 

The synthesis of ribonucleotide 3'-5-phosporothiolate analogs has been limited to 
the preparation of UspU (Liu et ai, Tetrahedron Lett, 1996, 57, 925) and IspU 
(Weinstein et al, J, Am. Chem, Soc, 1996, 118, 10341) dimcrs using solution chemistry. 
Recently, Sun et al {RNA 1997, 3, 1352) described direct incorporation of 3'-5- 

10 phosphorothioamidites into RNA using standard phosphoramidite solid phase synthesis. 

One general approach to the synthesis of 3'-thio ribonucleosides involves 
preparation of 3-thio ribose derivative, followed by the attachment of the desired 
nucleoside base (Ryan et al, J. Org, Chem, 1968, 33, 1783; Cao, et ai, Bioorg. Med 
Chem, Lett. 1994, 4, 807). While glycosylation reactions using pyrimidines proceed in 

15 high yields, purine bases generally give more complex mixtures because both N-7 and N- 
9 of the purine base arc reactive towards glycosylation. Sun et a/., supra, reported the first 
synthesis of 3'-thio guanosine derivatives using the above described approach. Coupling 
of per-acylated 3'-thioribose with persilylated iV^-acetylguanine proceeded in ca 40% 
yield and subsequent synthetic steps proceeded in a low overall yield. 

20 Synthesis of 3'-thio adenosine (Mengel, et al. Tetrahedron Lett, 1977,1 177), 3'- 

thio uridine (Liu et al, 1996 supra) and 3'-thio inosine (Higson et al, Tetrahedron 
1996, 52, 1027) all starting from preformed nucleosides are also reported. 

Applicant describes a novel and improved process for the synthesis of 3'-deoxy-3'- 
thio guanosine (13) and its phosphoramidite 23 from guanosine as a starting material. It 

25 was recently reported (He e/ a/., Tetrahedron Lett, 1995, J9, 6991) that the reaction of 
A^-(dimethylaminomethylene)-guanosine 1 with a-AlBBr (the Mattocks-Moffatt reagent; 
Rtissell et al, J, Am, Chem, Soc. 1973,95, 4025) proceeded stereoselectivcly yielding 
exclusively 3'-bromo-3'-deoxy-b-D-xylofuranosyl derivative. In general, reactions of 
base-unprotected purine nucleosides with this reagent result in the mixtures of trans bromo 

30 acetates of xylo- and arabino* configuration. Applicant ased this reaction on the suitably 
5'-protected A/2_(diniethylaminomethylene)guanosine derivative 2 (Scheme 1; Figure 18). 
5'- protection in 2 helps to reduce the complexity of reaction products by eliminating 
possible formation of the mixture of 5'-0H, 5'-(2,5,5-trimethyl-l,3-dioxolan-4-on-yl) 
and/or 5*-C>-acylated derivatives in reaction with a-AIBBr. This way, identification of the 

35 reaction products becomes straightforward. Applicant has chosen the Nbutyldiphenylsilyl 
(TBDPS) protection because of its relatively high stability towards acidic conditions 
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required during the reaction with a-AIBBr in moist acetonitrile. TMs group is also 
expected to undergo selective cleavage in the presence of i'-acyl groups. Reaction of 1 
with TBDPS-CI proceeded quantitatively to afford the 5'-(>silyl derivative 2 which 
reacted smoothly with a-AIBBr yielding the desired 3'.bromo-3'-deoxy-b-D-xylofuranosyl 
denvative 3 in a high yield. Reaction of 3 with potassium thioacetate or potassium 
Atobenzoate yielded the 3'^-Ac or V-S-Bz derivatives 5 and 6, respectively, along with 
3 .4'-unsaturated derivative 4. The latter is formed by competing elimination reaction 
The ratio was TrS" in favor of the substitution products 5 and 6 which, could not be 
separated from the elimination product 4 at this stage. The mixture of 4 and 5 (or 4 and 6) 
was treated with tetrabutylammonium fluoride (TBAF) buffered with an excess of acetic 
acid; following chromatographic separation the desired 5' de-protected derivative 7 was 
obtamed in a good yield. The misaturated derivative 4 was unstable under the acidic 
reaction conditions and could not be isolated in a pure state by sUica gel column 
chromatography. When triethylamine trihydrofluoride (TEA-SHF) reagent was used for 
1 5 deprotection, desilylation did not proceed to completion. 

It is desirable to keep guanosine derivatives protected with lipophylic groups 
dunng synthetic transformations because of the solubUity problems encountered with 
unprotected derivatives. For that reason 7 and 8 were re-protected in high yields with 
4,4'-dmiethoxytrityI (DMT) group to give the fully protected derivatives 9 and 10 
respectively. DMT group provided a hydrophobic tag which simplified woric-up and 
purification of subsequent synthetic intermediates. Next, 9 and 10 were converted into S- 
(pyridyl-2-disulfenyI) derivative 11 using aqueous methylamine followed by the disulfide 
exchange reaction with 2,2'-dipyridyl disulfide. It is reported that the removal of T-0- 
acyl protection in ribofiuanosyl derivatives similar to 9 and 10 proceeds with difficulty 
Applicant found that 40% aqueous methylamine easily removed all acyl protecting groups 
from 9 and 10 and was the base of choice because, contrary to aqueous ammonia it 
completely solubilised the fiiUy protected substrates. In situ protection of SH as 5-pyridyl- 
2-disulfanyl derivative was achieved using 2.2'.dipyridyl disulfide in DMF to afford 11 in 
85% yield for these two steps. 

In order to synthesize the free nucleoside. 13. 11 was treated with dithiothreitol 
(DTT) in chloroform. When triethylamine was added to the reaction mixture the reaction 
was faster than in it's absence but at the same time 12 was converted into its 5-TEA salt 
Final deprotection of the DMT group of 12 was achieved with IN HCl in methanol in the 
presence of DTT which quenched the released DMT-cation. In the absence of DTF 
quantitative S'-alkylation took place. Applicant is the first to report on the synthesis of the' 
guanosine 3'-tiuo analog 13. Wlien unbuffered TBAF in THF was used to desilylate the 
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mixture of 4 and 5, 5-Ac protecting group was also removed leading to formation of the 
disulfide 14 (Scheme 2; Figure 19). Under these conditions the 3',4'-unsaturated 
derivative 15 remained intact and was separated from 14 by silica gel column 
chromatography. Products were invariably contaminated with TBAF. Attempted 
5 rechromatography of 15 led to its decomposition-, though. Disulfide 14 was 5'-DMT 
protected to afford 16. 

The selective removal of the 3'-acetyl group of 16 (Scheme 2; Figure 19), 
followed by the introduction of 2'-0-r-butylduiiethylsilyl (TBDMS) protection, then 
reduction of 3 '-disulfide and 3'-phosphitylation would be the shortest way to prepare the 

10 desired 3'-thiophosphoramidite building block. Reactions of 16 with mild deacylating 
agents like basic ion exchangers in OH" or CN" form selectively removed 2'-0-acetyI 
protection, but at the same time nucleoside was strongly absorbed on the resin, resulting in 
low recoveries. Applicant used basic treatment followed by S-protection with S-pyridyl 
group, as used for the preparation of the 5-pyridyl-2-disulfanyl derivative 11 fi-om 5- 

1 5 acylated 9 and 10. In this manner 1 1 was obtained from the disulfide 16 in 67% yield. 

The phosphoramidite synthesis is shown in Scheme 3 (Figure 20). Reaction of 11 
with ^,7V-dimetfaylformamide dimethyl acetal yielded the desured ^-protected derivative 
18 in 23% yield. Unfortunately, this reagent also effected the cleavage of the 5-pyridyl 
protection leading to formation of disulfide 17 in 33% yield. 18 was smoothly 2' 

20 protected with TBDMS group using Nbutyldimethylsilyl trifluoromethanesulfonate 
(TBDMS-Tf). Altemativeiy, 1 1 was silylated with TBDMS-Cl to afford 20 and then A^- 
protected using isobutyric anhydride (/-BU2O) in the presence of 4-dimethylaminopyridine 
(DMAP) yielding the fully protected 21. In the absence of DMAP only starting material 
was recovered. On the other hand, reaction of 20 with isobutyryl chloride led to A^-bis- 

25 acylation. Reduction of 21 with DTT afforded 3'-SH derivative 22, which appeared as a 
mixture of two rotamers in NMR. Resonances of the major rotamer were in 
accordance with the ones reported by Sun et aL Phosphitylation of 22 under standard 
conditions afforded 3'-thiophosphoramidite 23. Applicant has described an efficient 
synthesis of 3'-deoxy-3'-thio guanosine. Keeping all synthetic intermediates protected 

30 with lipophylic groups enabled their chromatographic purification and, consequently, a 
good recovery of the products. 

Experimental Section 

General. All reactions were carried out under a positive pressure of argon in 
anhydrous solvents. Commercially available reagents and anhydrous solvents were used 
35 without further purification. *H (400.075 MHz) and ^^P (161.947 MHz) NMR spectra 
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were recorded in CDCI3, unless stated otherwise, and chemical shifts in ppm refer to TMS 
and H3PO4. respectively. Analytical thin-layer chromatography (TLC) was perfomied 
with Merck Art.5554 Kieselgel 60 F254 plates and flash column chromatography using 
Merck 0.040-0.063 mm silica gel 60. Mass spectra were obtained by fast atom 
bombardment method. 

5'-0-r-Bu<yIdiphenyIsiIyl-iV2.(dimethylaminomethyIene)guanosine (2). To a 

stirred solution of A^-(dimethylaminomethylene)guanosine (1) (5.5 g, 16.3 mmol) in 
pyridine (100 mL) /-butyldiphenylsilyl chloride (6.2 ml, 23.8 mmol) was added under 
argon. The reaction mixture was stirred at rt for 16 h, then quenched with methanol ( 20 
-ml) and evaporated to a syrup in vacuo. The residue was crystallized from ethanol-ether ( 
9 g, 96%). mp, 'H NMR (DMSO-d^ + D2O) d 8.46 (s, IH, CH=N). 7.89 (s, IH. H-S), 
7.58-7.31 (m, lOH, Ph), 5.81 (d, J,.^.=4.8, IH, H-1'), 4.46 (app t, Jj- i.=4.8,'iH,'h-2')' 
4.23 (app t, J3..2-5.O, IH, H-3'). 3.97 (m, IH. H-4'), 3.84 (dd. J5.,4.=2.'8. J5. 5«=I2.0, IH,' 
H-5'), 3.74 (dd, J5»4'=4.4, J5»,5-=12.0, IH. H-5"). 3.05 (s. 3H, Mej, 2.97 (s, 3H, Me),'o.94 
(s, 9H. r-Bu), HRMS (FAB+) calcd for C^oHsgNfiO^Si (MH+): calc 577.2595, found 
577.26095. 

I-(2-0-Acelyl.5-0.r.ButyIdiphenykiIy|.3-deosy.3-bn)ino-b-D.xyIofuranosyI)- 
iV2.(dimethylammomethylene)guanosine (3). To a cooled (0 oQ solution of 2 (5.8 g, 10 
mmol) and water (0.12 ml) in acetonitrile (130 mi) a-acetoxyisobutyryl bromide (5.56'ml, 
38 mmol) was added and the mixture was stirred at rt for 3 h. The solution was poured 
into saturated aq. NaHCOs (100 mL) and extracted with CH2CI2 (3 x 200 mL). The 
combined organic layers were dried (Na2S04) and concentrated to give 
chromatographically pure white foam (6 g, 87%), "H NMR d 8.97 (br s, IH, NH), 8.62 (s 
IH, CH=N). 7.82 (s, IH, H-8), 7.73-7.31 (m, lOH, Ph), 6.09 (s, IH, H-2'), 5.92 (d,' 
Jl..2'=1.6, IH, H-r). 4.42 (m, IH, H-4'), 4.36 (m, IH, H-3'), 4.06 (dd, J5. 4'=5.6, 
J5'.5"=I0.4, IH, H-5'). 3.97 (dd, J5..4.=6.4, J5.. 5 =10.4, 111, H-5"), 3.17 (s, 3H, N-Mt), 
3.07 (s, 3H, N-Mt), 2.19 (s, 3H. 0-Ac), 1.07 (s, 9H, t-Bu), HRMS (FAB+) calcd for 
CsiHsyBrNgOsSi (MH+): calc 681.1856, found 681.1850. 

l-(2-0-AcetyI.5.0-/.butyldiphenybilyl-3-deoxy- b -D-gfycero.pent-3- 
enofuranosyl-A2-(dimethyIaminomethylene)guanosine (4) and 2'-0-acctyI-5'-0.^. 

butyldiphenyl-silyl-3'-deoxy-3'^.thioacetyI-A«(dimethylammomethyIene)-guanosine 

(5). 3 (5.4 g, 7.9 mmol) was dissolved in dry DMF (50 ml) and potassium thioacetate (2.7 
g, 23.6 mmol) was added to the solution. The reaction mixture was stirred at 60 "C for 16 
h and then evaporated to a syrup under reduced pressure. The residue was partitioned 
between aq. NaHCOs-brine 1:1 solution and dichloromethane, the organic layer was dried 
(Na2S04), evaporated to dryness and chromatographed on the column of silicagel using 2- 
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10% gradient of methanol in dichloromethane 4 and 5 co-eluted yielding, after 
evaporation, a yellowish foam (4.8 g). NMR indicated a 3:7 ratio of 4 to 5. 

When potassium thiobenzoate was used .. instead of potassium thioacetate an 
inseparable mixture of 4 and 2'-0-acetyl-5'-0-/-butyldiphenykilyl-3'-deoxy-3'-5- 
5 thiobeiizoyl-7V^(diEnethyIaminomethyIene)guanosine (6) was obtained in a similar yield 
and ratio to the unsaturated derivative 4 as above. 

2'-a-Acetyl-3'-deoxy-3'-5-thioacetyl-7V2(diincthylaininometbylene) guanosine 
(7). The above mixture of 4 an 5 (0.9 g) was dissolved in THF (15 ml) and acetic acid 
(0.37 ml, 6.5 nmiol) was added followed by TBAF*3H20 (0.82 g, 2.6 mmol). The 

10 reaction mixture was stirred at rt for 5 h, then diluted with dichloromethane, washed with 
water and 10% aq. NaHC03. Aqueous layers were back-washed with dichloromethane, 
organic layers were combined, dried (Na2S04) and evaporated to dryness. Silica gel 
column chromatography using 2-10% gradient of methanol in dichloromethane afforded 7 
as a yellowish foam (300 mg, ca 74%), 'H NMR d 8.87 (br s, IH, NH), 8.78 (s, IH, 

15 CH=N), 7.73 (s, IH, H-8), 5.80 (d, Jr,2'=2.0, IH, H-l'), 5.76 (dd, Jr v^l.O, J2',3'=6.4, 
IH, H-2'), 4.94 (dd, Jr,2'=6.4, J3M'=9.6, IH, H-3'), 4.22 (d, J4',3>=9.6, IH, H-4'), 4.04 (br 
s, IH, 5'-OH), 3.99 (d, J5',5»=12.0, IH, H«5'), 3.71 (d, Js-^s =12.0, IH, H.5"), 3.19 (s, 3H, 
7^-Me), 3.04 (s, 3H, A^-Me), 2.34 (s, 3H, ^-Ac), 2.13 (s, 3H, 0-Ac), HRMS (FAB"*) calcd 
forC,7H22N606S (MH+): 439.1355, found 439.1405. 

20 2'-0-Acetyl-3'-deoxy-3'-iS-thiobenzoyI-A^(dimethyIaminomethylene) 

guanosine (8). Using the same procedure as above, 8 was synthesized from the mixture of 
4 and 6 in ca 70% yield, 'H NMR d 8.90 (br s, IH, Nil), 8.57 (brs, IH, NH), 7.69 (s, IH, 
H-8), 5,88 (m, 2H, H-r,H.2'), 5.30 (m, IH, H-3'), 4.34 (d, J4^^3-9.2, IH, H^'), 4.04 (d, 
J5',5'^=12.8, IH, H5'), 3.80 (dd, J5^oH=9.6, J5^^.v=12.8, IH, H-5"), 3.69 (br s, IH, 5'- 

25 OH), 3.25 (s, 3H, AT-Me), 3.10 (s, 3H, N-Mc), 2.16 (s, 3H, O-Ac), HRMS (FAB+) calcd 
for C,7H22N606S (MH-^): 501.1556, found 501.1561. 

2'-0-Acetyl-3'-deoxy-3'-5-thioacetyI-5*-0-(:^,^'-^im^/Aoxv/^^ 
A^(dimethylamuio-methylene)guaiiosine (9). 7 (720 mg, 1.64 mmol) was dissolved in 
dry pyridine (15 ml) and DMT-Cl (1.1 g, 3.3 mmol) was added. The reaction mixture 

30 was stirred at rt for 4 h, qenched with methanol and evaporated to a syrup which was 
partitioned between 5% aq. NaHC03 and CH2CI2. Organic layer was washed with brine, 
dried (Na2S04) and evaporated to dryness in vacuo. ..The residue was purified by silica gel 
column chromatography using 1-5% gradient of methanol in dichloromethane to yield the 
product as a colorless foam (0.85 g, 70%), ^H NMR d 8.69 (s, IH, CH=N), 8.58 (br s, IH, 

35 NH), 7.69 (s, IH, H-8), 7.38-6.74 (m, 13H, H-8, aromatic), 6.06 (dd, J2',3»=6.4, J2M-I.2, 
IH, H-2'), 5.82 (d, Jr,2-1.2, IH, H-l'), 4.73 (dd, Jj.4'=10.6, J.r.2=6.4,' IH, H.3'), 4.21 
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5 ), 3.07 (s. 3H, A^-Me), 3.05 (s. 3H, MMe). 2.26 (s. 3H. S-Ac), 2.15 (s 3H 0-Ac) 
HRMS(FAB-)calcd for C38H4oN608S (W): 741.2707, found 741.2692 ' ' 

2'-O.Acetyl.3'Hleoxy-3'-,5-thiobenzoyl-5'.0.(4.4'^imethoxytri(yI). 
7V^(d.me*hyl-aminomethyIene)guanosine (JO). Using similar p„x:edure as described 
above^S was converted into 10 in 69% yield. -H NMR d 8.80 (s, IH, CH=N). 8.65 (br s 
IH. NH), 7.70 (s, IH. H-8), 7.88-6.66 (m, 19H, an,n,atic), 6.17 (d, J^. ,.=5 8 IH H-2')' 

4T3 S^".. t '''' «-^'>: ^-^'^ H- 

4 ) 3.67(s,6H.2xOMe).3.45(m,2H,H-5',H-5"), 3.06 (s. 6H, 2XA^-Me) 2.15 (s 3H O- 
Ac). HRMS (FAB-) calcd for C,3H4oN,08S (MH-): 803.2863, found 803 2855 ' 

3^Deoxy.3'^.pyridybulfa„yI.5.^^4,4'-dimethoxytrityl)-gu^ (11). a 9 
(530 mg, 0.38 mmol) >vas dissolved in 40o/o aqueous methylamine (50 ml) and the mixture 
IS kept at rt for 16 h. The solvent is removed in vacuo and the residual syrup dissolved in 
argon purged DMF (30 ml) containing 2,2'-dipyridyl disulfide (340 mg. 1.54 mmol) The 
reaction mixture is heated at 60 oc for 10 h and then evaporated to a syrup in vacuo 
Column chromatography on silica gel using l-12«/o gradient of methanol in 
dichloromethane afforded 11 as a colorless solid (460 mg, 85o/,), "H NMR d 10.64 (br s 
H, NH). 8.39 (m, IH, Pyr), 7.83 (s, IH, H-8), 7.73-6.72 (m. 16H. an,matic), 6.50 (d 
JOH^ =4^80, IH, 0H-2-). 6.45 (br s, 2H, NH,), 5.81 (d, J,,2^2.4, IH. H-l'). 4.83 (m, IH, 
H-2), 4J4 (m, IH. H-4-), 4.09 (dd, J3.,=6.00, J3,,=7.8, IH. H-3-), 3.70 (s, 6H. 2X0Me), 
3.11 (dd, 15,5 -=11.2, J5.,4.=4.8. IH, H-5"). HRMS (FAB+) calcd for C^.H^.N^O.S. 
(MH^): 71 1.2060, found 71 1 .2076. ^.6" ,4N606S2 

B. Using the same procedure as above, but starting from S-benzoyl derivative 
10, 11 was prepared in 80% yield. 

25 C. Starting from 16 (830 mg, 1.12 mmol) and using the above conditions 

11 (570 mg, 67%) was obtained . 

3'.Deoxy-3'.thio.5'-OK4,4'.dlmethoxytrityl).g„ai.osine(12). To the solution of 
U (240 mg, 0.34 mmol) in chloroform (14 ml) dithiothreitol (DTT) (125 mg 0 81 mmol) 
was added and the reaction mixture was stirred at rt for 3 h. It was then evaporated to a 
30 syrup in vacuo, and the product was precipitated by addition of peroxide-free ether 
precipitate was filtered off. washed with ether and dried (230 mg of the crude material)' 
'H NMR (DMSO-d,) d 10.63 (br s. IH, NH), 7.86 (s, IH, H-8), 7.32-6.80 (m 13H 
aromatic), 6.49 (br s, 2H, NH^), 5.81 (s, IH, H-I'). 4.43 (d, J. 3,=4.8, IH, H.2'), 3 93 (m 
H, H-4-), 3.79 (dd. J3.,=4.8, J3,,.=9.6. IH, H-3-), 3.71 (s. 6H, 2XOMe), 3.16 (dd,' 
35 J5-,.v=10.4.J5..4.=4.8,IH.H.5"). 
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3'-Deoxy-3'-thio-guanosine (13). The mixUire of the crude 12 (230 mg, 0.33 
mmol) and DTT (150 mg) was dissolved in 1 N mcthanolic HCI (12 ml) and the reaction 
mixture was kept at rt for 3 h. It was then concentrated in vacuo and the residue 
coevaporated with toluene two times. Addition of ethyl acetate afforded precipitate which 
was filtered off, washed well with ethyl acetate and dried to afford 13 (90 mg, 79%). The 
product was reprecipitated from water, 'H NMR (CD3OD) d 8.10 (s, IH, H-8), 5.91 (s, 

IH, H-l'), 4.37 (d, j2'.3'=5.2. IH. H-2'), 3.97 (ra, ,2H, H-4', H-5'), 3.82 (dd, Js . 5 =13.0, 
J5",4'=3-4, IH, H-5"), 3.64 (dd, J3'.2'=5.2, 13 ,4 =9.6, IH, H-3'), HRMS (FAB+) calcd for 
C,oH,3N504S (MH+): 300.0767, found 300.0767. , . 

Bis (2-0-acetyl-A'2-(dimethylaininomethyIene)guanosin-3-yl)disullide (14 ) and 

1- (2-0-acetyl-3-deoxy- b -D-^/vc£ro-pent-3-enofurano^l)-iV2-(dimethyl- 
aminomethylene) guanosine (15 ). The mixture of 4 and 5 (4.8 g) was dissolved in THF 
(100 mL) and IM TBAF in THF (10 mL) was added. The reaction mixture was stirred for 
3 h at rt and then evaporated to a syrup in vacuo. Silica gel colunrn chromatography using 

2- 10% gradient of methanol in dichloromethane yielded the faster eluting 15 (1 g, 35% for 
two steps from 3, colorless foam), 'H NMR d 8.96 (br s, IH, NH), 8.57 (s, IH, CH=N), 
7.65 (s, IH, H-8), 6.41 (s, IH, H-2'), 6.04 (s, IH, H-l'), 5.42 (m, IH, H-3'), 4.32 (m, 2H, 
H-5',H-5"), 3.19 (s, 3H, A^-Me), 3.06 (s, 3H, AT-Me), 2.1 1 (s, 3H, Ac). The slower eluting 
14 was obtained as a yellowish solid (0.9 g, 29% for two steps), 'H NMR (DMSO-dg) d 

I I. 34 (br s, IH, NH), 8.53 (s, IH, CH=N), 8.00 (s, IH, H-8), 5.97 (d, Jr,2'=2.4, IH, H-l '), 
5.89 (dd, J2',r=2.4, J2',3-=6.0, IH, H-2'), 5.23 (t, Joh,5'=5.6. IH, 5'-OH), 4.1 1 (m, IH, H- 
4'), 4.02 (dd, J3-,2'=6.0, J3.4.=8.4, IH, H-3'), 3.78 (dm, J5..5"=12.0, IH, H-5'), 3.60 (dm, 
J5«,5'=12.0, IH, H-S"), 3.10 (s, 3H, iV-Me), 3.00 (s, 3H,' A^-Me), 2.06 (s. 3H, O-Ac), 
HRMS (FAB+) calcd for C30H38N, 20,082 (MH+): 791.2354, found 791.2355. 

Bis (2-0-acetyl-5-0-(4,4'diinethoxytrityl)-A/2.(diinethylaniinomethylene) gua- 
nosin-3-yl) disulfide (16). To the solution of 14 (400 mg, 0.5 mmol) in dry pyridine (10 
ml) DMT-Cl (508 mg, 1.5 mmol) was added and the mixture was stirred 4 h at rt. 
Methanol (10 ml) was added and the solution evaporated to dryness. The residue is 
partitioned between saturated NaIIC03 and dichloromethane, organic layer washed with 
brine, dried (Na2S04) and evaporated to a syrup. Silica gel column chromatography using 
2-10% gradient of methanol in dichloromethane yielded product as a yellowish foam (620 
mg, 71% yield), 'H NMR d 8.72 (br s, IH, NH), 8.61 (s, IH, CH=N), 7.48-7.21 (m, 14H, 
H-8, aromatic), 6.25 (d, J2-3-=4.8, IH, H-2'), 5.78 (s, IH, H-l'), 4.00 (m, 2H, H-3', II-4'), 
3.78 (s, 6H, 2xOMe), 3.50 (br s, 2H, H-5',H-5"), 3.i4 (s, 3H, A^-Me), 3.13 (s, 3H, AT-Me), 
1.82 (s, 3H, 0-Ac), HRMS (FAB+) calcd for C72H74N12014S2 (MH+): 1395.4967, 
found 1395.4943. 
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B,s (5-OK4,4 dimethoxytrityI)-A2^dime%laminoinefhylene) guanosin-S-yl). 
disulfide (17). A. 16 (60 mg, 0.04 nunol) is dissolved in dry methanol and ion exchange 
resin AG 1X8 (OH ) (1 g) is added. The mixture was stirred at 55 for 16 h the resin 
was filtered off and washed well with hot methanol. The filtrate was evaporated to 
dryness in vacuo yielding pure 17 as a colorless solid (16 mg, 28%), iH NMR (DMSO-d.) 
d 11.34 (br s, IH, NH), 8.48 (s, IH, CH=N), 7.92 (s, IH, H-8), 7.31-6.73 (m 13H 
aromatic). 6.27 (d, Joh.2-5.2. IH, 2'-0H), 5.88 (d, J, ,,=1.2, IH, H-l'), 4.60 (m IH H- 
2'). 4.16 (m, IH, H-4'), 4.08 (m, IH, H-3'), 3.65 (s, 6H, 2X0Me), 3.65 (m, 2H H-5 ' H- 
5"), 3.02 (s, 3H. JV-Me), 2.97 (s, 3H. N-Mc), HRMS (FAB-) calcd for C.gH^oNnO.'^S, 
(W): 1311.4756, found 1311.4746. " " 

B. Using Amberlyst A-26 (CN-) under the the above reaction conditions 17 
was obtained fcom 16 in 21% yield. 

3^Deoxy-3'^.pyridylsulfanyI.5'.0.(4,4'-dimethoxytrityI)-iV2-(d^^^ 
methyleiie)guanosinc (18) and 17. To the solution of 11 (400 mg. 0.56 mmol) m dry 
pyndme (5 ml), iy^,A^-dimethylformamide dimethyl acetal (1.2 ml, 9 mmol) was added and 
the reaction mixture was stirred at rt for 16 h. Solvents were removed in vacuo and the 
residue chromatographed on the column of silica gel using 1-50% gradient of methanol in 
drchloromediane. Fractions containing the faster rumiing material were combined and 
concentrated in vacuo to give 18 (1 10 mg, 23%), iHNMR d 8:73 (br s, IH, NH) 8 53 (s 
IH, CH=N), 8.49 (m, IH, pyridine), 7.71 (s, IH, H-8), 7.62 (m, IH, pyridine), 7 44-6 79 
(m. 15H, aromatic), 6.09 (s. IH. H-l'), 4.55 (d, J2,3.=4.8, IH, H-2'), 4.23 (dq, J4. .=10 5 
J4'.5-2.8, J,,y=3A, IH, H-4'), 4.14 (dd, J3.,.=4.8, J3..4-IO.5, IH, H-3'), 3.78% 6H 
2XOMe). 3.57 (dd, J5,5"=10.6. J,,,.=2.S. IH. H-5'), 3.41 (dd, J,. ,,.=10.6, J,.. ,,=3 4 1h' 
H-5"), 3.08 (s. 3H, Ar-CH3), 3.05 (s. 3H, N-CH,). Fractions containing the slower running 
compound were collected and evaporated to dryness in vacuo to give 17 as a colorless 
foam (120 mg, 33%). 'H NMR identical to that of 17 obtained obtained using the above 
procedures. 

2'-(?-r-ButyIdimethyIsiIyl-3'-dcoxy-3'^.pyridyIsulfany|.5'-0^4,4'. 
dimethoxytrityI)-A^.(dimethylan,ino-n.ethylene) guanosine (19). 18 (110 mg, 0 14 
mmol) was dissolved in dry pyridine (1 ml) and TBDMS-Tf (0.103 ml, 0.45 mmol) was 
added to the solution. The reaction mixture was stirred at rt for 5 h, then qunched with 
methanol and evaporated to a syrup in vacuo. The residue was dissoved in 
dichloromethane, washed with 5% aqueous NaHC03, then brine and the organic layer was 
dned (NajSO,) and concentrated to the syrup. Column chromatography on silica gel 
using 1-10% gradient of methanol in ethyl acetate afforded 19 as a colorless solid (90 mg 
71%), 'H NMR d 8.69 (br s, IH, NH), 8.50 (s, IH, CH=N), 8.37 (m, IH, pyridine) 7 81' 
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(s, IH, H-8), 7.50-6.74 (m, 16H, aromatic), 5.98 (d, Ji. 2'=2.4, IH, H-1'), 4.75 (dd, 
J2',3'=5.0, J2M =2.4, IH, H-2'), 4.50 (m,lH, H-4'), 3199 (dd, Jj ^^S.O. J3..4.=8.4, IH, H-3'), 
3.76 (s, 6H, 2XOCH3), 3.62 (dd, 15. 5.1=10.9. J5..4-2.2. IH, H-5'), 3.38 (dd, J5.,5 .=10.9, 
J5..,4M.4, IH, H-5"), 3.06 (s, 3H, NCK2), 3.04 (s, 3H, JVCH3), 0.93 (s, 9H, t-Bu), 0.17 (s, 
5 3H, Me). 0. 1 0 (s, 3H, Me), HRMS (FAB^ ) calcd for C45H53N706S2Si (MH+) 880.3346, 
found 880.3357. 

2'-0-/-ButyldimethyIsiIyI-3'-deoxy-3'-5-pyridylsuIfanyl-5'-0-(4,4'- 
dimethoxytrityl) guanosine (20). 3'-Deoxy-3'-5'-pyridyIsulfanyl-5'-C>-(4,4'- 
dimethoxytrityl)guanosine 11 (410 mg, 0.58 mmol) was dissolved in dry pyridine (36 ml) 

10 and imidazole (2.36 g, 35 nunol) and TBDMS-Cl (4.29 g, 28 mmol) were added. The 
reaction was stirred at rt 16 h, then evaporated to a syrup in vacuo. The residue was 
partitioned between dichloromethane and saturated aq. NaHC03, organic layer washed 
with water, dried (Na2S04) and concentrated to a syrup. Column cgromatography on 
silica gel using 1-10% gradient of methanol in dichloromethane afforded the product as a 

15 white foam (430 mg, 85%). "H NMR (DMSO-d^) d 1 1.99 (br s, IH, NH), 8.39 (m, IH, 
Pyr), 7.84 (s, IH, H-8), 7.71 (m, IH, Pyr), 7.62-6.72 (m, 15H, aromatic) 6.41 (br s. 2H, 
NH2), 5.80 (d. Ji ,2 =4.4, IH, H-r), 5.04 (app t, J2'.3'=4.4. IH, H-2'), 4.39 (m, IH, H-4'), 
4.01 (app t, 13. 4 =6.4, IH, H-3'), 3.69 (s, 6H, 2X0Me), 3.13 (dd, J5.-,5.=l 1.0, J5-..4 =4.6, . 
IH, H-5"), 0.82 (s, 9H, /-Bu), 0.08 (s, 3H, Me), 0.06 (s, 3H, Me), HRMS (FAB+) calcd for 

20 C42H48N606S2Si (MH+) 825.2924, found 825.2977. 

2'-0-/-Butyldimethykilyl-3'-deory-3'-5-pyridyIsulfanyl-5'-0-(4,4'- 
dimethoxytrityl)-^-isobutyrylguanosine (21). To the solution of 20 (310 mg, 0.38 
mmol) in dry pyridine (5 ml) isobutyric anhydride (0.19 ml, 1.14 mmol) and DMAP (46 
mg, 0.38 mm) were added and the mixture was stirred at rt 16 h. It was then stirred at 50 

25 °C for 5 h, quenched with methanol (2 ml) and evaporated to a syrup in vacuo. The 
residue was partitioned between dichloromethane and 5% aqueous NaHC03, the organic 
layer was washed with brine, dried (Na2S04) and evi^orated to a syrup. Column 
chromatography on silica gel using 1-5% gradient of methanol in CHiCh afforded 
product as a colorless foam (320 mg, 95%), 'H NMR d 1 1.94 (br s, IH, NH), 8.36 (m, IH, 

30 Pyr), 7.85 (m, IH, Pyr), 7.80 (s, IH, H-8), 7.58-6.71 (m, 16H, NH, aromatic), 5.83 (d, 
J,- 2'=5.2. IH, H-1'), 5.22 (app t, Jr.2 =5.2. IH, H.2'), 4.50 (m, IH, H-4'), 4.27 (app t, 
J3-.4 =6.4, IH, H-3'), 3.76 (s, 3H, OMe), 3.75 (s, 3H, OMe), 3.56 (dd, Js- j-^ILO, 
J5-,4 =1.8, IH, H5'), 2.98 (dd, J5-.,5 =11.0, J5 -.4 =3.0, IH, H-5"), 1.69 (m, IH, C/fMe,). 
0.94 (d, J=7.2, 3H, CH3), 0.76 (d, J=7.2, 3H, CH3), 0:88 (s, 9H, t-B\x), 0.11 (s, 3H, Me), 

35 0.06 (s, 3H, Me), HRMS (FAB+) calcd for C46H54N607S2Si (MH+) 895.3343, found 
895.3380. 
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2'-O^.ButyIdimethyIsily|.3'^leoxy.3^fhio.5VO-(4,4'^imetho3ytrityl)-^^ 
isobalyiylguanosine (22). To the solution of 21 (340 mg. 0.38 mmol) in chlorofom, (20 
ml) TEA (0.4 ml) and dithiotrcitoi DTT (140 mg, 0.91 mmol) were added and the reaction 
mixture was stirred for 1 h at rt. The reaction miture was washed with saturated aqueous 
5 NaHC03. water, dried (Na2S04) and concentrated to a syrup. Silica gel column 
chromatography using 0.5-2% gradiem of methanol in CH.CI. afforded 22 (270 mg 
90%), IH NMR for the major rotamer: d 11 .92 (br s, IH, NH), 7.93 (s, IH, H-8) 7 63-6 80 
(m, I6H, NH, aromatic), 5.83 (d, J, - 2 =2.8, IH, H-I'), 4.74 (dd, J,- ,.=2.8, J.^ 3. =5.4, IH 
H-2'), 4.13 (dd, J4, 3^=7.6, J4-,5 =1.2, IH, H-4'), 3.78 (s, 3H, OMe), 3.73 (sV3H. OMe)' 
10 3.63 (dd. J5.,5"=l 1.0. J5<.4'=1.2. IH. H5'), 3.27 (dd. J5-..5.=l 1 .0, J5.. 4.=3.0, IH. H-5")' 
L63 (d, JsH.3-=8.4. IH, SH), 2.08 (m, IH. CMle,), 1.09 (d, J=6.8, 3H, CH3), 0.98 (d' 
J-6.8, 3H. CH3). 0.91 (s, 9H. /-Bu). 0.14 (s, 3H. Me). 0.08 (s, 3H. Me), HRMS (FAB+) 
calcd for C4 ,H5 , NjOTSSi (MH+) 786.3357. found 786.3354. 

2•-O^.B^tyldimethyIsiIyl.3♦wieoxy-3'.thio-5•.^M4.4•-dimethoxytrityl).^2. 
15 isobutyrylguanosine 3'-^-(2-cyanoetliyl AyV^ihopropylphoramidite (23) 
Phosphitylation of 22 as described by Sun et al afforded product which was purified by 
flash chromatography using 0.5% ethanol in CH.Cl. containing 1% TEA. Final product 
was obtained as a white powder by precipitation from toluene-pentane at 0 oc (76% 
yield), 31p NMR d 163.5 (s). 159.6 (s), HRMS (FAB^) calcd for C,oH68N70«PSSi (MH+) 
^0 986.4435. found 986.4406. - « / » ) 

Example 13: Synthesis of S'. tMoDho.snhate. nn Heoside nhosnhnr^midite and p rep.r.rin« 
of solid supp ort 



Referring to Figure 21, Applicant shows a scheme for synthesis of 5'-deoxy-5'- 
thionucleoside phosphoramidites and succinates. For example Lehmann et al, (NAR 
1989, 17, 2379; incorporated by reference herein) describes the preparation of the more 
base-stable sarcosyl modified solid support. 

Matulic-Adamic. et al. in Nucleosides & Nucleosides 1997, 16, 1933, describes 
the incorporation of 5'-thio modification into oligonucleotides. Applicant additionally 
describes the a method where, the cleavage of the 5'-protecting group on the solid support 
m order to elongate the chain is accomplished by using, instead of AgN03 in CH3CN, 
O.IM l,8-dia2abicyclo[5.4.0]undec-7-ene (DBU) in CH3CN or 20% piperidine in 
CH3CN. 

Note that 5'S-DMT protection is cleaved with iodine (Henningfeld et al. JACS 
1996. 118, 11701) thus complicating the oligo synthesis, while S-Fm is reported to be 
resistant to 0.1 M iodine in DMF. 
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Oligonucleotides were synthesized with 5'-thiophosphate linkage and tested in 
vitro in a standard RNase H cleavage assay described in McSwiggen, US Patent No. 
5,525,468; incorporated by reference herein. ■ 

Example 14: Synthesis o f 5'-0-dimethoxvtritvl-3^deoxv-3^-thio-3'-5-r2-Cvanoethvl-KN- 
Diisoprop vlphosphoramidite-2'-0-methvl iiridme (6) ^Figure 22^ 

S'-O'tert-butyldiphenylsilyl'l'-O-methyl uridine (1): To a solution of T-O- 
methyl uridine stirring at 0 C under positive pressure argon in anhydrous pyridine was 
added /m-butyldiphenylsilyl chloride (1.2 eq.) The reaction mixture was allowed to 
warm to rt and was maintained at rt for 18 hours at which time ethanol was added, 
pyridine removed in vacuo, and the reaction residue partitioned between dichloromethane 
and sat. aq. sodium bicarbonate. The organic layer was then dried over sodium sulfate. 
Flash chromatography using an ethyl acetate/hexanes gradient gave (1) as a white foam, 

5'-0'tert-butyldiphenylsifyl-2''0-methyl-2J''anhydro uridine (2): To a solution 
of (1) and DEAD (3.5 eq.) stirring at 0 C under argon in anhydrous THF was added 
triphenylphosphine (3.5 eq.). The reaction mixture was warmed to rt and stirred at rt 
under argon for 18 hours, after which THF was removed in vacuo. The crude reaction 
residue was partitioned between dichloromethane and sat. aq. sodium bicarbonate, the 
organics dried over sodium sulfate preceding flash chromatography. An etliyl 
acetate/hexanes gradient afforded (2) as an off white foam. 

5''O'tert-butyldiphenylsily!'3'~S'acetyl-2'-0'methyl uridine (3): Compound (2) 
was treated with thiolacetic acid in dioxane at 100 C for 18 hours while stirring in a 
stainless steel bomb. The reaction mixture was evaporated in vacuo then purified by flash 
silica gel chromatography to give (3) as a light yellow foam. 

5''0-dimethoxytrityU3'-S-acetyl-2'-0'methyl uridine (4): To a solution of (3) 
stirring at rt under positive pressure argon was added IM TBAF in THF buffered with 
acetic acid. The resulting clear, light yellow solution was stirred at rt for one hour, then 
THF removed in vacuo. Crude (4) was flash silica purified using an 
ethanol/dichloromethane gradient. The purified product was then co-evaporated with 
anhydrous pyridine, then dissolved in anhydrous pyridine. Dimethoxytrityl chloride was 
added to the reaction at rt and the resulting clear, reddish solution stirred at rt for 1 8 hours. 
Pyridine was removed in vacuo after quenching with ethanol, and the resulting crude foam 
partitioned between dichloromethane and sat. aq. sodium bicarbonate and the organics 
dried over sodium sulfate. Flash chromatography using an ethyl acetate/hexanes gradient 
ftimished pure (4). 
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5 '-O-dimethoxytrityl-3 '-deoxy-3 •-thio-2 '-O-methyl undine (5): Compound (4) was 
dissolved in 40% aq. methylamine in the presence of DTT. The reaction mixture was 
stirred at rt for one hour then evaporated in vacuo. Flash chromatography using an ethyl 
acetate/hexanes gradient gave (5) as an off wliite foam. 

5 '-O-dimethoxytrityl-3 '-deoxy-3 '-thio-3'-S-(2-Cyanoethyl-N,N.Diisopropylphos- 
phoramidite.2 '.O-methyl uridine (6): To a cooled (0 °C) solution of (5) and A^.iV-diiso- 
propylethylamine in dry CH2CI2 was added 2-cyanoethyl A^.^V-diisopropylchlorophos- 
phoramidite dropwise via syringe. The mixture was stirred at room temperature uotil all 
starting material was consumed (5 hr.) The reaction mixture was quenched with 
anhydrous ethanol and diluted with hexanes. Flash chromatography using an ethyl 
acetate/hexanes gradient provided pure (6). 

Diagno.stic u.se«8 

Nucleic acid molecules of this invention may be used as diagnostic tools to 
examine genetic drift and mutations within diseased cells or to detect the presence of 
specific RNAs in a cell. The close relationship between for example ribozyme activity 
and the structure of the target RNA allows the detection of mutations in any region of the 
molecule which alters the base-pairing and three-dimensional structure of the target RNA. 
By using multiple ribozymes described in this invention, one may map nucleotide changes - 
which are important to RNA structure and function in vitro, as well as in cells and tissues. 
Cleavage of target RNAs with ribozymes may be used to inhibit gene expression and 
define the role (essentially) of specified gene products in the progression of disease. In 
this manner, other genetic targets may be defmed as important mediators of the disease. 
These experiments will lead to better treatment of the disease progression by affording the 
possibility of combinational therapies (e.g., multiple nucleic acid molecules targeted to 
different genes, nucleic acid molecules coupled with known small molecule inhibitors, or 
intermittent treatment with combinations of nucleic acid molecules and/or other chemical 
or biological molecules). Other in vitro uses of nucleic acid molecules of this invention 
are well known in the art, and include detection of tiie presence of RNAs related to various 
conditions. Such RNA is detected by determining the presence of a cleavage product after 
treatment with for example, an enzymatic nucleic acid molecule using standaid 
methodology. 

In a specific example, ribozymes which can cleave only wild-type or mutant forms 
of the target RNA are used for the assay. The first ribozyme is used to identify wild-type 
RNA present in the sample and the second ribozyme will be used to identify mutant RNA 
in the sample. As reaction controls, synthetic substrates of both wild-type and mutant 
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RNA will be cleaved by both ribozyraes to demonstrate the relative ribozyme eflBciencics 
in the reactions and the absence of cleavage of 'the "non-targeted" RNA species. The 
cleavage products from the synthetic substrates will also serve to generate size markers for 
the analysis of wild-type and mutant RNAs in the sample population. Thus each analysis 
5 will require two ribozymes, two substrates and one unknown sample which will be 
combined into six reactions. The presence of cleavage products will be determined using 
an RNAse protection assay so that full-length and cleavage fragments of each RNA can be 
analyzed in one lane of a polyacrylamide gel It is not absolutely required to quantify the 
resuhs to gain insight into the expression of mutant RNAs and putative risk of the desired 

10 phenotypic changes in target cells. The expression of mRNA whose protein product is 
implicated in the development of a phenotype is adequate to establish risk. If probes of 
comparable specific activity are used for both transcripts, then a qualitative comparison of 
RNA levels will be adequate and will decrease the cost of the mitial diagnosis. Higher 
mutant form to wild-type ratios will be correlated with higher risk whether RNA levels are 

1 S compared qualitatively or quantitatively. 

Additional Uses 

Potential usefulness of sequence-specific enzymatic nucleic acid molecules of the , 
instant mvention might have many of the same applications for the study of RNA that 
DNA restriction endonucleases have for the study of DNA (Nathans et aL, 1975 Ann. Rev, :.: 
20 Biochem. 44:273). For example, the pattern of restriction fragments could be used to * 
establish sequence relationships between two related RNAs, and large RNAs could be^ 
specifically cleaved to fragments of a size more useful for study. The ability to engineer ■ 
sequence specificity of the ribozyme is ideal for cleav2ige of RNAs of unknown sequence. 

Other embodiments are within the following claims. 
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TABLE! 

Characteri stics of naturally occurring ribozy mes 

Group I Introns 

Size: -150 to >1000 nucleotides. 

Requires a U in the target sequence immediately 5' of the cleavage site. 
Binds 4-6 nucleotides at the 5;-side of the cleavage site. 

Reaction mechanism: attack by the 3'-OH of guanosine to generate cleavage products with 
3 *-0H and 5'-guanosine. 

Additional protein cofactors required in some cases to help folding and maintainancc of 
the active structure. 

Over 300 known members of this class. Found as an intervening sequence in 

T nrahymena thermophila rRNA, fungal mitochondria, chloroplasts, phage T4, blue-green 

algae, and others. 

Major structural features largely established through phylogenetic comparisons, 

mutagenesis, and biochemical studies [^'^]. 

Complete kinetic framework established for one ribozyme 

Studies of ribozyme folding and substrate docking underway [^,*,^]. 



^ Michel, Francois; Westhof, Eric. Slippery substrates. Nat. Struct. Biol. (1994), 1(1), 5.7. 

Lisacek, Frederique; Diaz, Yolande; Michel, Francois. Automatic identification of group I intron 
cores m genomic DNA sequences. J. Mol. Biol. (1994), 235(4), 1206-17. 

Herschlag, Daniel; Cech, Thomas R.. Catalysis of RNA cleavage by the Tetrahymena thermophila 
ribozyme. 1. Kinetic description of the reaction of an RNA substrate complementary to the active site 
Biochemistry (1990), 29(44), 10159-71. 

Herschlag, Daniel; Cech, Thomas R.. Catalysis of RNA cleavage by the Tetrahymena thermophila 
ribozyme. 2. Kinetic description of the reaction of an RNA substrate that forms a mismatch at the active 
site. Biochemistiy (1990), 29(44), 10172-80. 

^ Knitt, Deborah S.; Herschlag, Daniel. pH Dependencies of the Tetrahymena Ribozyme Reveal an 

Unconventional Origin of an Apparent pKa. Biochemistry (1996), 35(5), 1560-70. 

Bevilacqua, Philip C; Sugimoto, Naoki; Turner, Douglas H.. A mechanistic framework for the 
second step of splicmg catalyzed by the Tetrahymena ribozyme. Biochemistry (1996), 35(2), 648-58. 

Li, Yi; Bevilacqua, Philip C; Mathews, David; Turner, Douglas H.. Thermodynamic and activation 
parameters for binding of a pyrene-labeled substrate by the Tetrahymena ribozyme: docking is not diffusion- 
controlled and is driven by a favorable entropy change. Biochemistry (1995), 34(44), 14394-9. 

Banerjee, Alokc Raj; Turner, Douglas H.. The time dependence of chemical modification reveals 
slow steps m the foldmg of a group I ribozyme. Biochemistry (1995), 34(19), 6504-12. 
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Chemical modification investigation of important residues well established [*^"]. 

• The small (4-6 nt) binding site may make this ribozyme too non-specific for targeted RNA 
cleavage, hov^ever, the Tetrahymena group I intron has been used to repair a "defective" 
P-galactosidase message by the ligation of new p-galactosidase sequences onto the 

5 defective message [^^]. 

RNAse P RNA (Ml KNA) 

Size: --290 to 400 nucleotides. 

• RNA portion of a ubiquitous ribonucleoprotein enzyme. 
Cleaves tRNA precursors to form mature tRNA [^^]. 

1 0 • Reaction mechanism : possible attack by M^^-OH to generate cleavage products with 3 
OH and S '-phosphate. 

RNAse P is found throughout the prokaryotes and cukaiyotes. The RNA subunit has been 
sequenced from bacteria, yeast, rodents, and primates. 

Recruitment of endogenous RNAse P for therapeutic applications is possible through 
1 5 hybridization of an External Guide Sequence (EGS) to the target RNA ^] 

Important phosphate and T OH contacts recently identified [^^,'^] 



Zairinkar, Patrick P.; Williamson, James R.. The P9.I-P9.2 peripheral extension helps guide folding 
of die Tetrahymena ribozyrae. Nucleic Acids Res. (1996), 24(5). 854-8. 

Strobel, Scott A.; Cech, Thomas R.. Minor groove recognition of the conserved G.cntdot.U pair at 
the Tetrahymena ribozyme reaction site. Science (Washington, D. C.) (1995), 267(5198), 675-9. 

" Strobel, Scott A.; Cech, Thomas R.. Exocyclic Amine of the Conserved G.cntdot.U Pair at the 
Cleavage Site of the Tetrahymena Ribozyme Contributes to 5'-Splice Site Selection and Transition State 
Stabilization. Biochemistry (1996), 35(4), 1201-1 1. 

*^ Sullenger, Bruce A.; Cech, Thomas R.. Ribozyme-mediated repair of defective mRNA by targeted 
trans-splicing. Nature (London) ( 1 994), 371(6498), 619-22. 

" Robertson, H.D.; Altman, S.; Smith, J.D. J. Biol. Chem., 242, 5243^5251 (1972). 

Forster, Anthony C; Altman, Sidney. External guide sequences for an RNA enzyme. Science 
(Washington, D. C, 1883-) (1990), 249(4970), 783-6. 

Yuan, Y.; Hwang, E. S.; Altman, S. Targeted cleavage of mRNA by human RNasc P, Proc. Natl 
Acad. Sci. USA (1992) 89, 8006-10. 

Han-is, Michael E.; Pace, Norman R.. Identification of phosphates involved in catalysis by the 
ribozyme RNase P RNA. RNA (1995), 1(2), 210-18. 

Pan, Tao; Loria, Andrew; Zhong, Kun. Probing of tertiary interactions in RNA: 2'-hydroxyl-base 
contacts between the RNase P RNA and pre-tRNA. Proc. Natl. Acad. Sci. U. S. A. (1995) 92(26) 125 10- 
14. 
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Group II rntrnns 

Size: > 1 000 nucleotides. 

Trans cleavage of target RNAs recently demonstrated 
Sequence requirements not fully determined. 

Reaction mechanism: 2'-0H of an internal adenosine generates cleavage products with 3'- 
OH and a "lariat" RNA containing a 3'-5' and a 2'-5' branch point. 
Only natural ribozyme with demonstrated participation in DNA cleavage [V] in addition 
to RNA cleavage and ligation. 

Major stnictural features largely established through phylogenetic comparisons 
Important 2' OH contacts beginning to be identified [^^] 
Kinetic framework under development 

NeurosDora VS RNA 

• Size: ~I44 nucleotides. 

TRANS CLEAVAGE OF HAIRPIN TARGET RNAS RECENTLY DEMONSTRATED t"l. 

Sequence requirements not fiilly determined. 

Reaction mechanism: attack by 2'-OH 5' to the scissile bond to generate cleavage 
products witii 2',3'-cyclic phosphate and S'-OH ends. 



A^vin n'*** '^^f ' ^T"' a Kinetic Framework for Group II Intron Ribozvme 

Acdvity: Quantitation of Interdomain Binding and Reaction Rate. Biochemistry (1994). 359X 2716^^^ 

" Michels, William J. Jr.; Pyle, Anna Marie. Conversion of a Group II Intron into a New Muitinle 
Turnover Ribo^e that Selectively Cleaves Oligonucleotides: Elucidation ofR^t^M^Z^^'^a 
Stnicnire>Tuncnon Relationships. Biochemistiy (1995), 34(9), 2965-77. wecnanism and 



Zimmerly, Steven; Guo, Huatao; Eskes, Robert- Yan" Jian- Perlmnn Phiiir. q • t » a i 

\ Griffui, Edmund A., Jr.; Qin, Zhifeng; Michels, Williams J., Jr.; Pyle, Anna Marie Groun H intrnn 

(1995). m!^1%T^°^' ^'"^ " Anna. Rev. Biochem. 

Abramovitz, Dana L.; Friedman, Richard A.; Pyle. Anna Marie. Catalytic role of 2'-hvdroxvl 
groups withm a group II intron active site. Science (Washington. D. C.) (1996) 27K5254) J4i1TJ 

llHn ^1""'"' J" Py''^' Anna Marie. Two competing pathways for self- 

(f5S6T2560r3Vi9"!''°"'^ ' " P^-'i- J BioL 

" Guo Hans C. T.; Collins, Richard A.. Efficient trans-cleavage of a stem-loop RNA substrate hv « 
ribozyme derived from Neurospora VS RNA. EMBO J. (1995), 14(2), 368-76 ^ 
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• Binding sites and structural requirements not fiilly determined. 

• Only 1 known member of this class. Found in Neurospora VS RNA. 

Hammerhead Ribozvme 
(see text for references) 
5 • Size: -13 to 40 nucleotides. 

Requires the target sequence UH immediately 5' of the cleavage site. 

• Binds a variable number nucleotides on both sides of the cleavage site. 
Reaction mechanism: attack by 2'-0H 5' to the scissile bond to generate cleavage 
products with 2\3*-cyclic phosphate and 5'-OH ends. 

10 • 14 known members of this class. Found in a number of plant pathogens (virusoids) that 
use RNA as the infectious agent 

Essential structural features largely defined, including 2 crystal structures t^^] 
Minimal ligation activity demonstrated (for engineering through in vitro selection) [^^] 

• Complete kinetic framework established for two or more ribozymes ^\ 

IS • Chemical modification investigation of important residues well established ^\ 

Hairpin Ribozvme 

• Size: -50 nucleotides. 

• Requires the target sequence GUC immediately 3* of the cleavage site. 

Binds 4-6 nucleotides at the 5'-side of the cleavage site and a variable number to the 3*- J 
20 side of the cleavage site. 

Reaction mechanism: attack by 2'-0H 5' to the scissile bond to generate cleavage : 
products with 2',3'-cyclic phosphate and 5'-OH ends. 



Scott, W.G., Finch, J.T., Aaron,K. The crystal structure of an all RNA hammerhead 
ribozyme:Aproposed mechanism for RNA catalytic cleavage. Cell, (1995), 81, 991-1002. 

" McKay, Structure and function of the hammerhead ribozyme: an unfmished story. RNA, (1996) 2 
395-403. 



Long, D., Uhlenbeck, O., Hertel, K. Ligation witfi hammeihead ribozymes. US Patent No. 
5,633,133. 

29 

Hertel, KJ., Herschlag, D., Uhlenbeck, O. A kinetic and thermodynamic framework for the 
hammerhead ribozyme reaction. Biochemistry, (1994) 33. 3374-3385.Beigelman, L., et ai, Chemical 
modifications of hammerhead ribozymes. J. Biol. Chem., (1995) 270, 25702-25708. 

Beigelman, L., et ai. Chemical modifications of hammerhead ribozymes. J. Biol. Chem., (1995) 
270, 25702-25708. 
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• 31™™,n«n6e„of«,i,.,«. Found ,„*,«p,«p„H„,,„,3«.„„R„^,„f,^ 

Essential structural features lai^ely defined f^' 33 34j 

engineenng through /«v//>-o selection [3^J , 
Complete kinetic framework established for one ribozyme [^«J 
• Chemical modification investigation of important residues begun [^Vj. 

Hepatitis Delta Virus mnv) Viu^^^^ 
'0 • Size: -60 nucleotides. 

Trans cleavage of target RNAs demonstrated [^'] 
• «^'''"S«*--^^'™cturalrec,^^^^^^^ 

^'"'^^"'^^^-^"■-'^ ^^•''^•"'-^econtainsapseudoknotstructuren 

M.. EssenSul^'^l^^^^^^^^^ M.; Butcher, Samuel E, Burke John 

0993), 12(6). 2567-73. "'^"'''^ of the hairpin ribozyme. EMBO^^^^^ 

misiuaclttd subsoats. Oenes Dev (1593), ™ ) 130 , "l««"n. "mtttion. and analysis of 

36 ^'^ reactions. Genes Dev. (1992). 6(1). 129-34 

Hegg,LisaA.;Fedor,MarthaJ Kinetic<ianHTK»r™ J 
Haupin Ribozymes. BiochemisUy (1995). sSl SJir " Intermolecular Catalysis by 

MiS^i^^s ""r^ ^'«'»«. s ■ 

c.ea.a.o:^S^32~C.,t^-^^^^^ 
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Reaction mechanism: attack by 2'-0H 5' to the scissilc bond to generate cleavage 

products with 2',3'-cyclic phosphate and 5'-OH ends. 

Only 2 known members of this class. Found in human HDV. 

Circular form of HDV is active and shows increased nuclease stability [^'] 



Puttaraju, M.; Pen'otta, Anne T.; Been, Michael D.. A circular trans-acting hepatitis delta virus 
ribozyme. Nucleic Acids Res. (1993), 21(18), 4253-8. 
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Table II: 2.5 Mmol RNA Synthesis Cycle 

Reagent Equivalents Amount Wait 

Time* 



Phosphoramidites 


6.5 


163 nL 


2.5 


i'-Ethyl Tetrazole 


23.8 


238 nL 


2.5 


Acetic Anhydride 


100 


233 nL 


5 sec 


A'-Methyl Imidazole 


186 


233 ML 


5 sec 


TCA 


83.2 


1.73 mL 


21 sec 


Iodine 


8.0 


1.18 mL 


45 sec 


Acetonitrile 


NA 


6.67 mL 


NA 
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Table IV. Hammerhead Ribozyme and Target SequencesFor £strogen Receptor 



rOS 


KZ 


Seq. ID. 
No. 


Substrate 


Seq. ID. 
No. 


22 


UUGGCUUA CUGAUGAG X CGAA ACAUCACU 


1 


AGTGATGT T TAAGCCAA 


1246 


23 


AUUGGCUU CUGAUGAG X CGAA AACAUCAC 


2 


GTGATGTT T AAGCCAAT 


1247 


24 


CAUUGGCU CUGAUGAG X CGAA AAACAUCA 


3 


TG ATGTTT A AGCCAATG 


1248 


34 


CUUGCCCU CUGAUGAG X CGAA ACAUUGGC 


4 


GCCAATGT C AGGGCAAG 


1249 


SI 


CGGCCAGG CUGAUGAG X CGAA ACUGUUGC 


5 


GCAACAGT C CCTGGCCG 


1250 


61 


UGCUGGAG CUGAUGAG X CGAA ACGGCCAG 


6 


CTGGCCGT C CTCCAGCA 


1251 


64 


AGGUGCUG CUGAUGAG X CGAA AGG ACGGC 


7 


GCCGTCCT C CAGCACCT 


1252 


73 


GCAUUACA CUGAUGAG X CGAA AGGUGCUG 


8 


CAGCACCT T TGTAATGC 


1253 


74 


UGCAUUAC CUGAUGAG X CGAA AAGGUGCU 


9 


AGC ACCTT T GTAATGC A 


1254 


77 


AUAUGCAU CUGAUGAG X CGAA ACAAAGGU 


10 


ACCTTTGT A ATGCATAT 


1255 


84 


CGAGCUCA CUGAUGAG X CGAA AUGCAUUA 


11 


TAATGCAT A TGAGCTCG 


1256 


91 


GGUCUCCC CUGAUGAG X CGAA AGCUCAUA 


12 


TATGAGCT C GGGAGACC 


1257 


103 


ACUUUAAG CUGAUGAG X CGAA ACUGGUCU 


13 


AGACCAGT A CTTAAAGT 


1258 


106 


CCAACUUU CUGAUGAG X CGAA AGUACUGG 


14 


CC AGTACT T AAAGTTGG 


1259 


107 


UCCAACUU CUGAUGAG X CGAA AAGUACUG 


15 


CAGTACTT A AAGTTGGA 


1260 


112 


GGGCCUCC CUGAUGAG X CGAA ACUUUAAG 


16 


CTTAAAGT T GGAGGCCC 


1261 


148 


CCAGGACG CUGAUGAG X CGAA ACGCCCUC 


17 


GAGGGCGT T CGTCCFGG 


1262 


149 


CCCAGGAC CUGAUGAG X CGAA AACGCCCU 


18 


AGGGCGTT C GTCCTGGG 


1263 


152 


GCUCCCAG CUGAUGAG X CGAA ACGAACGC 


19 


GCGTTCGT C CTGGGAGC 


1264 


167 


GACGGAGC CUGAUGAG X CGAA AGUGCAGC 


20 


GCTGCACT T GCTCCGTC 


1265 


171 


ACCCGACG CUGAUGAG X CGAA AGCAAGUG 


21 


CACTTGCT C CGTCGGGT 


1266 


175 


GGCGACCC CUGAUGAG X CGAA ACGGAGCA 


22 


TGCTCCGT C GGGTCGCC 


1267 


180 


AAGCCGGC CUGAUGAG X CGAA ACCCGACG 


23 


CGTCGGGT C GCCGGCTT 


1268 


188 


GUCCGGUG CUGAUGAG X CGAA AGCCGGCG 


24 


CGCCGGCT T CACCGGAC 


1269 


189 


GGUCCGGU CUGAUGAG X CGAA AAGCCGGC 


25 


GCCGGCTT C ACCGGACC 


1270 


205 


UGCCCCGG CUGAUGAG X CGAA AGCCUGCG 


26 


CGCAGGCT C CCGGGGCA 


1271 


231 


CGACACGC CUGAUGAG X CGAA AGCUCUGG 


27 


CCAGAGC'I C GCGTGTCG 


1272 



10 



15 



20 



25 



30 
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^JS j uui.t>cucu CUGAUGAG X CGAA ACACGCG) 


\ 28 


ICGCGTGTCGGCGGGA< 


1273 


UUAGAGut. CUGAUGAG X CGAA ACGCAGC( 
263 CGAGGUUa UUUAUGAG X CGAA AGGCGACC 


3 29 


CGCTGCGTCGCCTCTA/ 


^ 1274 


5 30" 


CGTCGCCTCTAACCTCG 


1275 


265 CCCGAGGU uuuAUGAG X CGAA AGAGGCGi^ 


31 


TCGCCTCTAACCTCGGG 


1276 


- ;u CACAGClJC uuuaugau X CGAA AGGUUAGA 


32 


TCI AACCT C GGGCTGTG 


1277 


28 J UGUAAAAAUUUAUGAGXCGAAAGCACAGC 


33 


UCTGTGCTCmTTCCA 


1278 


2tiJ LCU^Aaa uuuauGAG X CGAA AGAGCACA 


34 


IGTGCTCnriTCCAGG 


1279 


284 ACCUkmkjm CUGAUGAG X CGAA AAGAGCAC 


35 


GlGCrCTTT'lTCCAGGT 


1280 


28^ LACCUGOA CUUAUGAG X CGAA AAAGAGCA 


36 


J GCTCTTT r TCCAGGTG " 


1281 


286 LCACCUoo CUGAUGAG X CGAA AAAAGAGC 


37 


GCTCTTTT \ CCAGGTGG 


1282 


28/ ^^t^ACCUCi CUGAUGAG X CGAA AAAAAGAG 


38 


cic rrnr c caggtggc ■ 


1283 


JU4 GOCUi^AuA CUGAUGAG X CGAA ACCGGCGG"" 


39 


CCGCCGGT 1 TCTGAGCC 


1284 


JU^ AGGL.UCAU CUGAUGAG X CGAA AACCGGCG " 


40 


cgccggtt r ctgagcct 


1285 


306 AAGGCUCA CUGAUGAG X CGAA AAACCGGC" 


41 


GCCGGITT C I G AGCCTT 


1286 


314 CAGOgcag CUGAUGAG X CGAA AGGCUCAG" 


42 


C 1 UAGCCT T CTGCCCTG 


1287 


:>iD\ UCAUUGCA CUGAUGAG X CGAA AAGGCUCA " 


43 


r i GAGCCIT C TGCCCTGC 


1288 


335 AGGOUGCA CUGAUGAG X CGAA ACCGUGUC " 


44 


UACACGGT C TGCACCCt " 


1289 


J j UUUOUGUG CUGAUGAG X CGAA AGGGUCAlT 


45 


AlUACCCTCCACACCAA ' 


1290 


389 ^<-AUCCCA CUGAUGAG X CGAA AUGCUUUG 

402 UGAUGCAG CllCiAWCAn V m a a a C^r^r^ryry A T , ■ 


46 


CAAAGCAT C TGGGATGG" " 


1291 


1 wrvji-i.wvj\-.^vj \^<Ju/\uuAU A CuAA AGGGCCAU 


47 


AlGGCCCTACTGCATCA 


1292 


fuy UUGGAUCU CUGAUGAG X CGAA AUGCAGUA " 


48 


tactgcatcagatccaa 


1293 


'H'* Uu«-t,».uuu LUUAUGAG X CUAA AUCUOAUG " " 


49 


CAICAGATCCAAGGGAA " 


1294 


443 GAGCUUCU CUGAUGAG X CGAA ACGOUUCA " 


50 


I GAACCGT C CGCAGCTC" " 


1295 


433 GOGAUCUU CUGAUGAG X CGAA AGCUGCGG" " 


51 


CCUCAGCT C AAGATCCC 


1296 


UCCAGGGG CUGAUGAG X CGAA AUCUUGAG " 
488 UGUCCagg CUGAUGAG X CGAA ACACCUCG " 


52 


C 1 CAAGAT C CCCCTGGA 


1297 


m:> GGUAGuug CUGAUGAG X CGAA ACACGGCG " 


■"53 
"■"54 


CG AGGTGT A CC TGr. A r a ~ 
CGCCGTGTACAACTACC " 


1298 
1299 


321 CCUCGGUU CUGAUGAG X CGAA AGUUOUAC " 
33y UGAACUCG CUGAUGAG X CGAA AGGCGGCG " 


55 


GIACAACTACCCCGAGG 


1300 


""56 


CUCCGCCT A CG AGTTCA 


1301 


M5 CGGCGUUU CUGAUGAG X CGAA ACUCGUAG 


57 . 


c 1 ACGAGT T CAACGCCG 


1302 


346 GCGGCGUU CUGAUGAG X CGAA AACUCGUA 


58 


lACUAGTTCAACGCCGC 


1303 
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576 


UGACCGUA CUGAUGAG X CGAA ACCUGCGC 


59 


GCGCAGGT C TACGGTCA 


1304 


578 


UCUGACCG CUGAUGAG X CGAA AGACCUGC 


60* 


GCAGGTCT A CGGTCAGA 


1305 


583 


GCCGGUCU CUGAUGAG X CGAA ACCGUAGA 


61 


TCTACGGT C AGACCGGC 


1306 


594 


CCGUAGGG CUGAUGAG X CGAA AGGCCGGU 


62 


ACCGGCCT C CCCTACGG 


1307 


599 


CGGGGCCG CUGAUGAG X CGAA AGGGGAGG 


63 


CCTCCCCT A CGGCCCCG 


1308 


611 


CAGCCUCA CUGAUGAG X CGAA ACCCGGGG 


64 ' 


CCCCGGGT C TGAGGCTG 


1309 


626 


UGGAGCCG CUGAUGAG X CGAA ACGCCGCA 


65 ' 


TGCGGCGT T CGGCTCCA 


1310 


627 


UUGGAGCC CUGAUGAG X CGAA AACGCCGC 


66 ' 


GCGGCGTT C GGCTCCAA 


1311 


632 


GGCCGUUG CUGAUGAG X CGAA AGCCGAAC 


67 


GTTCGGCT C CAACGGCC 


1312 


649 


UGGGGGGA CUGAUGAG X CGAA ACCCCCCA 


68 


TGGGGGGT T TCCCCCCA 


1313 


650 


GUGGGGGG CUGAUGAG X CGAA AACGCCGC 


69 


GGGGGGTT T CCCCCCAC 


1314 


651 


AGUGGGGG CUGAUGAG X CGAA AAACCCCC 


70 


GGGGGTIT C CCCCCACT 


1315 


"660" 


ACGCUGUU CUGAUGAG X CGAA AGUGGGGG 


71 


CCCCCACT C AACAGCGT 


' 131^ 


671 


GGCUCGGA CUGAUGAG X CGAA ACACGCUG 


72 


CAGCGTGT C TCCGAGCC 


1317 


673 


CGGGCUCG CUGAUGAG X CGAA AGACACGC 


73 


GCGTGTCT C CGAGCCCG 


1318 


690 


GGGUGCAG CUGAUGAG X CGAA AGCAUCAG 


74 


CTGATGCT A CTGCACCC 


1319 


713 


GGAAAGGC CUGAUGAG X CGAA ACAGCUGC 


75 


GCAGCTGT C GCCTTTCC 


1320 


718 


CUGCAGGA CUGAUGAG X CGAA AGGCGACA 


76 


TGTCGCCT T TCCTGCAG 


1321 


719 


GCUGCAGG CUGAUGAG X CGAA AAGGCGAC 


77 


GTCGCCTT T CCTGCAGC 


1322 


720 


GGCUGCAG CUGAUGAG X CGAA AAAGGCGA 


78 


TCGCCTTT C CTGCAGCC 


1323 


749 


CCAGGUAG CUGAUGAG X CGAA AGGGCACC 


79 


GGTGCCCT A CTACCTGG 


1324 


752 


UCUCCAGG CUGAUGAG X CGAA AGU AGGGC 


80 


GCCCTACT A CCTGGAGA 


1325 


776 


GCACCGUG CUGAUGAG X CGAA AGCCGCUG 


81 • 


CAGCGGCT A CACGGTGC 


1326 


806 


GCCUGUAG CUGAUGAG X CGAA AUGCCGGC 


82 


GCCGGCAl r CTACAGGC 


1327 


807 


GGCCUGU A CUGAUGAG X CGAA AAUGCCGG 


83 ' 


CCGGCATT C TACAGGCC 


1328 


o\jy 


UUvjOCCUU UUOAUOAO a coaa auaauocc 


84 


GGCATTCT A CAGGCCAA 


1329 


820 


AUUAUCUG CUGAUGAG X CGAA AUUUGGCC 


85 


GGCCAAAT T CAGATAAT 


1330 


821 


GAUUAUCU CUGAUGAG X CGAA AAUUUGGC 


86 


GCCAAATT C AGATAATC 


1331 


826 


GCGUCGAU CUGAUGAG X CGAA AUCUGAAU 


87 


ATTCAGA I A ATCGACGC 


1332 


829 


CUGGCGUC CUGAUGAG X CGAA AUUAUCUG 


88 


CAGATAAT C GACGCCAG 


1333 


854 


UACUGGCC CUGAUGAG X CGAA AUCUUUCU 


89 


AGAAAG AT T GGCCAGTA 


1334 
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|862 


GUCAUUGG CUGAUGAG X CGAA ACUGGCCA 


90 


TGGCCAGT A CCAATGAC 


1335 


S80 


CAUAGCCA CUGAUGAG X CGAA ACUUCGCU 


91 


AGGGAAGT A TGGCTATG 


1336 


886 


AGAUUCCA CUGAUGAG X CGAA AGCCAUAC 


92 


GTATGGCT A TGGAATCT 


1337 


893 


CCUUGGCA CUGAUGAG X CGAA AUUCCAUA 


93 


TATGGAAT C TGCCAAGG 


1338 


907 


ACAGUAGC CUGAUGAG X CGAA AGUCUCCU 


94 


AGGAGACT C GCTACTGT 


1339 


911 


CUGCACAG CUGAUGAG X CGAA AGCGAGUC 


95 


GACTCGCT A CTGTGCAG 


1340 


932 


CUGAAGCA CUGAUGAG X CGAA AGUCAUUG 


96 


CAATGACT A TGCTTCAG 


1341 


937 


GUAGCCUG CUGAUGAG X CGAA AGCAUAGU 


97 


ACTATGCTT CAGGCTAC 


1342 


938 


GGUAGCCU CUGAUGAG X CGAA AAGCAUAG 


98 


CTATGCTT C AGGCTACC 


1343 


*94T" 


CAUAAUGG CUGAUGAG X CGAA AGCCUGAA 


99 


TTCAGGCT A CCATTATG 


1344 


949 


GACUCCAU CUGAUGAG X CGAA AUGGUAGC 


100 


GCTACCAT T ATGGAGTC 


1345 


950 


AGACUCCA CUGAUGAG X CGAA AAUGGUAG 


lot 


CTACCAIT A TGG AGTCT 


1346 


957 


CAGGACCA CUGAUGAG X CGAA ACUCCAUA 


102 


TATGGAGT C TGGTCCTG 


1347 


"962 


CCUCACAG CUGAUGAG X CGAA ACCAGACU 


103 


AGTCTGGT C CTGTGAGG 


1348 


98r 


UCUUGAAG CUGAUGAG X CGAA AGGCCUUG 


104 


CAAGGCCT T CTTCAAG A 


1349 


'W 


CUCUUGAA CUGAUGAG X CGAA AAGGCCUU 


105 


AAGGCCTT C TTC AAG AG 


1350 


986 


UUCUCUUG CUGAUGAG X CGAA AGAAGGCC" 


106 


GGCCTTCT T CAAGAGAA 


1351 


987 


CUUCUCUU CUGAUGAG X CGAA AAGAAGGC 


107 


GCCTTCTT C AAGAG AAG 


1352 


997 


UCCUUGAA CUGAUGAG X CGAA ACUUCUCU 


108 


AGAGAAGT A TTCAAGGA 


1353 


999 


UGUCCUUG CUGAUGAG X CGAA AUACUUCU 


109 


AG AAGTAT T CAAGGACA 


1354 


1000 


AUGUCCUU CUGAUGAG X CGAA AAUACUUC 


no 


G AAGTATT C AAGGACAT 


1355 


1009 


AUAGUCGU CUGAUGAG X CGAA AUGUCCUU 


HI 


AAGGACAT A ACGACTAT 


1356 


1016 


GACACAUA CUGAUGAG X CGAA AGUCGUUA 


112 


TAACGACT A TATGTGTC 


1357 


1018 


UGGACACA CUGAUGAG X CGAA AUAGUCGU 


113 


ACGACTAT A TGTGTCCA 


1358 


1024 


GGUGGCUG CUGAUGAG X CGAA ACACAUAU 


114 


ATATGTGT C CAGCCACC 


1359 


in^7 
1U4 / 


UUUUUAUC CUGAUGAG X CGAA AUGGUGCA 


115 


TGCACCAT T G ATAAAAA 


1360 


1051 


CCUGUUUU CUGAUGAG X CGAA AUCAAUGG 


116 


CCATTGAT A AAAACAGG 


1361 


1086 


CAUIKJGCG CUGAUGAG X CGAA AGCCGGCA 


117 


TGCCGGCT C CGCAAATG 


1362 


1097 


CCACUUCG CUGAUGAG X CGAA AGCAUUUG " 


118 


CAAATGCT A CGAAGTGG 


1363 


1125 


UCUUUUCG CUGAUGAG X CGAA AUCCCACC 


119 


GGTGGGAT A CGAAAAGA 


1364 


1154 


UGUGUUUC CUGAUGAG X CGAA ACAUUCUC 


120 


GAGAATGT T GAAACACA 


1365 
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1205 


CUCCAGCA CUGAUGAG X CGAA ACCCCACU 


121 


AGTGGGGT C TGCTGGAG 


1366 


1233 


CUUGGCCA CUGAUGAG X CGAA AGGUUGGC 


122 


GCCAACCT T TGGCCAAG 


1367 


1234 


GCUUGGCC CUGAUGAG X CGAA AAGGUUGG 


123 


CCAACCTT T GGCCAAGC 


1368 


1248 


UUGAUCAU CUGAUGAG X CGAA AGCGGGCU 


124 


AGCCCGCT C ATGATCAA 


1369 


1254 


GAGCGUUU CUGAUGAG X CGAA AUCAUOAG 


125 


CTCATGAT C AAACGCTC 


1370 


1262 


UCUUCUUA QJGAUGAG X CGAA AGCGUUUG 


126 


CAAACGCT C TAAGAAGA 


1371 


1264 


GUUCUUCU CUGAUGAG X CGAA AGAGCGUU 


127 


AACGCTCT A AGAAGAAC 


1372 


1283 


UCAGGGAC CUGAUGAG X CGAA AGGCCAGG 


128 


CCTGGCCT T GTCCCTG A 


1373 


1286 


CCGUCAGG CUGAUGAG X CGAA ACAAGGCC 


129 


GGCCTTGT C CCTGACGG 


1374 


1308 


AAGGCACU CUGAUGAG X CGAA ACCAUCUG 


130 


CAGATGGT C AGTGCCTT 


1375 


1316 


CAUCCAAC CUGAUGAG X CGAA AGGCACUG 


131 


CAGTGCCT T GTTGGATG 


1376 


1319 


CAGCAUCC CUGAUGAG X CGAA ACAAGGCA 


132 


TGCCTTGT T GGATGCIG 


1377 


1338 


GAAUAGAG CUGAUGAG X CGAA AUGGGGGG 


133 


CCCCCCAT A CTCTATTC 


1378 


1341 


UCGGAAUA CUGAUGAG X CGAA AGUAUGGG 


134 


CCCATACT C TATTCCGA 


1379 


1343 


ACUCGGAA CUGAUGAG X CGAA AGAGUAUG 


135 


CATACTCT A TTCCGAGT 


1380 


1345 


AUACUCGG CUGAUGAG X CGAA AUAGAGUA 


136 


TACTCTAT T CCGAGTAT 


1381 


1346 


CAUACUCG CUGAUGAG X CGAA AAU AGAGU 


137 


ACTCTATT C CGAGTATG 


1382 


1352 


UAGGAUCA CUGAUGAG X CGAA ACUCGGAA 


138 


TTCCGAGT A TGATCCI A 


1383 


1357 


UCUGGUAG CUGAUGAG X CGAA AUCAUACU 


139 


AGTATGAT C CTACCAGA 


1384 


1360 


GGGUCUGG CUGAUGAG X CGAA AGGAUCAU 


140 


ATGATCCT A CCAGACCC 


1385 


1370 


CUUCACUG CUGAUGAG X CGAA AGGGUCUG 


Ml 


CAGACCCT T CAGTGAAG 


1386 


1371 


GCUUCACU CUGAUGAG X CGAA AAGGGUCU 


142 


AGACCCTT C AGTGAAGC 


1387 


1381 


CAUCAUCG CUGAUGAG X CGAA AGCUUCAC 


143^ 


GTGAAGCT T CGATGATG 


1388 


1382 


CCAUCAUC CUGAUGAG X CGAA AAGCUUCA 


144: 


TGAAGCTT C GA TGATGG 


1389 


1394 


UGGUCAGU CUGAUGAG X CGAA AGCCCAUC 


145' 


GATGGGCT T ACTGACCA 


1390 


HQS 


I II inni iPAfi PI If vAi'inAr: y cciaa a a^pppai i 

U \J \J\J \J y^r^yj \^\J\Ji\\JKj/\\3 W \^VJ/\/\ /vr\VJv^^^/\U 


140. 


A 1 OUuC 1 1 A CI \j ACL AA 


1391 


1425 


AUCAUGUG CUGAUGAG X CGAA ACCAGCUC 


147. 


GAGCTGGT T CACATGAT 


1392 


1426 


GAUCAUGU CUGAUGAG X CGAA AACCAGCU 


148 


AGCTGGTT C ACATGATC 


1393 


1434 


GCCCAGUU CUGAUGAG X CGAA AUCAUGUG 


149 


CACATGAT C AACFCGGC 


1394 


1460 


AAUCCACA CUGAUGAG X CGAA AGCCUGGC 


150 


GCCAGGCT T TGTGGATT 


1395 


1461 


AAAUCCAC CUGAUGAG X CGAA AAGCCUGG 


151 


CCAGGCTT T GTGG ATTl 


1396 
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.•.«o u/vuuuucA LUGAUGAG X CGAA AUCCACA/ 


^ 1 152 


ITGTGGAT T TGACCCTC 


: 1397 


1469 OOAUGGUC CUGAUGAG X CGAA AAUCCAO 


\ 1 153 


IGTGGATTTGACCCTCC 


1398 


1476 uuAUCAUG CUGAUGAG X CGAA AGG'GvCA/ 


^ 154 


ITGACCCJ' C CATGATCA 


1399 


1 1483 OUGUALX;U CUGAUGAG X CGAA AUCAUGOA 
"1488 AUAAUGUO CUGAUGAG XCUAAACCUQAUC 
1494 CAUUCUAG CUGAUGAG X CGAA AGGUGGAC 


I 1 155 
i 156" 


JCCATGAT c aggtccac 


1400 




UATCAGGl CCACCTTCT" 


UOI 




UlCCACCTTCTAGAATG 


1402 


1495 ACAUUt-UA CUGAUGAG X CGAA AAGGUGGA 


158 


TCCACCTT C l AGAATGT 


1403 


1497 OCACAUUC CUGAUGAG X CGAA AGAAGGUG 


159 


CACCTTCT A GAATGTGC " 


1404 


1512 AOGAUCUC CUGAUGAG X CGAA AGCCAGGC 


160 


GCCTGGCT A GAGATCCT 


1405 


1518 AUCAUCAG CUGAUGAG X CGAA AUCUCUAG 


161 


CTAGAGAl C CTGATGAT 


1406 


1 527 AUUAUACC CUGAUGAG X CGAA AUCAUCAG 


162 


CTGATGAl 1 GGTCTCGT 


1407 


1 53 1 .^CAGACGA CUGAUGAG X CGAA ACCAAUCA 


163 


1 UATTGGT C I CGTCTGG 


1408 1 


1533 CG(,(_AGAC CUGAUGAG X CGAA AGACCAAU 


164 


A JTGGTCT C GTCTGGCG 


1409 


1536 UAUCGCUA CUGAUGAG X CGAA ACGAGACC 


165 


GGTCTCGT C TGGCGCTC 


1410 


1544 ucuuuAUG CUGAUGAG X CGAA AGCGCCAGi 


166 


CI GGCGCT C C ATGGAGC 


1411 


1566 ^AAAUAU CUGAUGAG X CGAA AGCUUdAcj 


167 


G TGAAGCT A CTGTTTGC 


1412 


1571 UAGGAULA CUGAUGAG X CGAA ACAGUAGC] 


168 


UCTACTGT T TGCTCCTA 


1413 


1572 uuAGUAOC CUGAUGAG X CGAA AACAGUAG] 


169 


Ci ACTGTT T GCTCCTAA" 


1414 


1576 .^AAUUUAG CUGAUGAG X CGAA AGCAAAC/Tf 


170 


1 GriTGCT c ctaacttg ■ " 


1415 


1579 OAGCAAGU CUGAUGAG X CGAA AGGAGCAAT 


171 


11 GCTCCT A ACTTGCTC " 


1416 


1583 CCAAOAUC CUGAUGAG X CGAA AGUUAGGAT 


172 


tcctaacttgctcttgg 


1417 


1 587 ^uuucCAA CUGAUGAG X CGAA AGCAAGUUT 


173 


AACTTGCT c ttggacag " 


1418 


1 589 UCCUUUCC CUGAUGAG X CGAA AGAGCAAG 


174 


cn GCTCT T ggacagga ■ ■ 


1419 


1614 AUUUCCUC CUGAUGAG X CGAA ACACAUUU 


175 


aaatgtgt a g agggcat ■ " 


1420 


1632 AUOUCGAA CUGAUGAG X CGAA AUCUCCAcf 
1634 GCAUGIJPn PnoArmAr' v r^r^ a a "Zr^ * wr^w,^^ • t- 


176 


U i ggagat c ttcgacat 


1421 


v^v./\u^Ju^^^J L.UUAUUAU X CGAA AGAUCUCC 


177 


GG AG ATCT T CGACATGC " 


1422 


1635 ACiCAUGUC CUGAUGAG X CGAA AAG AUCUC " 


178 " 


gagatcttcgacatgct ■ 


1423 


1651 AUAUUAUG CUGAUGAG X CGAA AGCCAGCAT 


179 


i CiC l GGCT A CATCA rCT " 


1424 


1655 ACCGAGAU CUGAUGAG X CGAA AUGUAGCC 


180 


ugctacat c atctcggt 


1425 


1658 ^^AAi^uuA CUGAUGAG X CGAA AUGAUGUaT 


181 


lACATCATCTCGGTTCC 


1426 


1660 OtOG/VACC CUGAUGAG X CGAA AGAUGADgT" 


182 


catcatci c ggttccgc " 


1427 
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5 



10 



20 



30 



1664 


UCAUGCGG CUGAUGAG X CGAA ACCGAGAU 


183 


ATCTCGGT T CCGCATGA 


1428 


1665 


AUCAUGCG CUGAUGAG X CGAA AACCGAGA 


184 


TCTCGGTT C CGCATGAT 


1429 


1678 


UCCCUGCA CUGAUGAG X CGAA AUUCAUCA 


185 


TGATGAAT C TGCAGGGA 


1430 


1694 


GGCACACA CUGAUGAG X CGAA ACUCCUCU 


186 


AGAGGAGT T TGTGTGCC 


1431 


1695 


AGGCACAC CUGAUGAG X CGAA AACUCCUC 


187 


GAGGAG n' T GTGTGCCT 


1432 


1704 


AUAGAUUU CUGAUGAG X CGAA AGGCACAC 


188 


GTGTGCCT C AAATCTAT 


1433 


1709 


AAAUAAUA CUGAUGAG X CGAA AUUUGAGG 


189 


CCTCAAAT C TATTATTT 


1434 


1711 


CAAAAUAA CUGAUGAG X CGAA AGAUUUGA 


190 


TC AAATCT A TTATTTTG 


1435 


1713 


AGCAAAAU CUGAUGAG X CGAA AUAGAUUU 


191 


AAATCTAT T ATTTTGCT 


1436 


1714 


AAGCAAAA CUGAUGAG X CGAA AAUAGAUU 


192 


AATCTATT A TTTTGCTT 


1437 


1716 


UUAAGCAA CUGAUGAG X CGAA AUAAUAGA 


193 


TCTATTAT T TTGCTTAA 


1438 


1717 


AUUAAGCA CUGAUGAG X CGAA AAUAAUAG 


194 


CTATTATT T TGCTI AAT 


1439 


1718 


AAUUAAGC CUGAUGAG X CGAA AAAUAAUA 


195 


TATTATTT T GCTTAATT 


1440 


1722 


CCAGAAUU CUGAUGAG X CGAA AGCAAAAU 


196 


ATTTTGCT T AATTCTGG 


1441 


1723 


UCCAGAAU CUGAUGAG X CGAA AAGCAAAA 


197 


TTTTGCTT A ATTCTGGA 


1442 


1726 


CACUCCAG CUGAUGAG X CGAA AUUAAGCA 


198 


TGCTTAAT T CTGGAGTG 


1443 


1727 


ACACUCCA CUGAUGAG X CGAA AAUUAAGC 


199 


GCTTAATT C TGGAGTGT 


1444 


1736 


GAAAUGUG CUGAUGAG X CGAA ACACUCCA 


200 


TGGAGTGT A CACATTTC 


1445 


1742 


UGGACAGA CUGAUGAG X CGAA AUGUGUAC 


201 


GTACACAT T TCTGTCCA 


1446 


1743 


CUGGACAG CUGAUGAG X CGAA AAUGUGUA 


202 


TACACATT T CTGTCCAG 


1447 


1744 


GCUGGACA CUGAUGAG X CGAA AAAUGUGU 


203 


ACACATTT C TGTCCAGC 


1448 


1748 

L, 


GGGUGCUG CUGAUGAG X CGAA ACAGAAAU 


204 


ATTTCTGT C CAGCACCC 


1449 




CUUCCAGA CUGAUGAG X CGAA ACUUCAGG 


205 


CCTGAAGT C TCTGGAAG 


1450 


1765 


CUCUUCCA CUGAUGAG X CGAA AGACUUCA 


206 


TGAAGTCT C TGGAAGAG 


1451 


1783 


UCGGUGGA CUGAUGAG X CGAA AUGGUCCU 


207 


AGGACCAT A TCCACCGA 


1452 


1785 


ACUCGGUG CUGAUGAG X CGAA AUAUGGUC 


208' 


GACCATAT C CACCGAGT 


1453 


1794 


UUGUCCAG CUGAUGAG X CGAA ACUCGGUG 


209 


CACCG AGT C CTGGACAA 


1454 


1806 


GUGUCUGU CUGAUGAG X CGAA AUCUUGUC 


210 


GACAAGAT C ACAGACAC 


1455 


1816 


GUGGAUCA CUGAUGAG X CGAA AGUGUCUG 


211 


CAGACACT T TGATCCAC 


1456 


1817 


GGUGGAUC CUGAUGAG X CGAA AAGUGUCU 


212 


AG ACACrr T GATCCACC 


1457 


1821 


AUCAGGUG CUGAUGAG X CGAA AUCAAAGU 


213 


AcrnriAT c cacctgat 


1458 
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1 1 ooi 1 AUUAUUACi CUGAUUAG X CGAA AGCUGGO 
im\ AUUAUGAG CUGAUGAO X CGAA AGGAGCU< 


C 214 


GCCCAGC T C CTCCTCA1 


^ 1 1459 


1 

Ih^ll ^AUAUGAUCUGAUGAGXCGAA AGGAGGAC 


3 215 


CAGCTCCr C CTCATCCT 


1460 


>oyv\ UUUUAOAGCUGAUGAUXCGAAAUGAGGAC 


} JlT 


CTCCTCCT'CATCCTCTC' 


1461 




f 217 


CTCCTCATCCTCTCCCA 


1462 


1893 AUGUGGGACUGAUGAGXCGAAAGGAUGAG 


218 


CICATCCTCTCCCACAT 


1463 


185^5 UGAUGUGG CUGAUGAG X CGAA AGAGGAUG 


219 


UAICCTCTCCCACATCA 


1464 


1902 AUGUUUCU CUGAUGAG X CGAA AUGUGGGA 


220 


TCCCACAT C AGGCACAT 


1465 


15^15 GCCUUUGU CUGAUGAG X CGAA ACUCAUGU 


221 


ACATG AGT A ACAAAGGC 


1466 


1933 GCUCiUACA CUGAUGAG X CGAA AUGCUCCA 


222 


IGGAGCAT C TGTACAGC" 


1467 


1937 UCAUULUG CUGAUGAG X CGAA ACAGAUGC 
1968 AGGUCAUACUGAUGAUXCGAAAGGGGCAC 


223 
224 


GCATCTGT A CAGCATGA 
GIGCCCCTC J ATGACCT 


1468 1 

1469 1 


19/u GCAGGUCA CUGAUGAG X CGAA AGAGGGGC 


225 


UCCCCTCT A rGACCTGC 


1470 


^UU7 GGCGCAUG CUGAUGAG X CGAA AGGCGGUG 
iOiO UCCACGGC CUGAUGAG X CGAA AGUGGGCO 


226 
227 


CACCGCCT A CATGCGCC 
CUCCCACT A GCCGTGGA"* 


1471 
1472 


2036 CCUCCACG CUGAUGAG X CGAA AUGCCCCU " 


228 


ACiGGGCAT C CGTGGAGG 


1473 


20W CAGUGGCC CUGAUGAG X CGAA AGUGGCUU 


229 


AAGCCACT 1 GGCCACTG 


1474 


Uo/« AUGAAUUA CUGAUGAG X CGAA AGCCCGCA 


230 


iCiCGGGCTCTACTTCAT 


1475 


2080 CGAUGAAG CUGAUGAG X CGAA AGAGCCCG 


231 


UGGGCTCTACTTCATCG 


1476 


2083 AUGCGAUG CUGAUGAG X CGAA AGUAGAGC 


232 


GCTCTACT T CATCGCAT 


1477 


2U84 AAUGCOAU CUGAUGAG X CGAA AAGUAGAG " 


233 


C rCTACTT C ATCGCATT 


1478 


2087 AGGAAUGC CUGAUGAG X CGAA AUGAAGUA " 


234 


I ACTTCAT C GCATTCCT 


1479 


20y2 UUGCAAUG CUGAUGAG X CGAA AUGCGAUG " 


235 


CAicciCAT T CCTTGCAA 


1480 


^uyj UUUGCAAG CUGAUGAG X CGAA AAUGCGAU " 


236 


A I CGCATT C C ITGCAAA 


1481 


2096 ACUUUUGC CUGAUGAG X CGAA AGGAAUGC " 


237 


UCAITCCTTGCAAAAGT 


1482 


2105 UGAUGUAA CUGAUGAG X CGAA ACUUUUGC " 


238 


GCAAAAGT A TTACATCA " 




2IU7 CGUGAUGU CUGAUGAG X CGAA AUACUUUU " 


239 


AAAAGTATTACATCACG " 


1484 


2108 CCGUGAUG CUGAUGAG X CGAA AAUACUUU " 


240 


AAAGTATi'ACATCACGG " 


1485 


^112 UCCCCCGU CUGAUGAG X CGAA AUGUAAUA " 


241 


1 A n ACAT C ACGGGGG A " " 


I486 


^131 GGCAGGGA CUGAUGAG X CGAA ACCCUCUG 


242 


CAGAGGGT 1^ TCCCTGCC 


1487 


2 132 UGGCAGUG CUGAUGAG X CGAA AACCCUCU 


243 


AUAGGGri'TCCCTGCCA " 


1488 


2133 UUUUCAGG CUGAUGACi X CGAA AAACCCUC 


244 


UAUUGTTTCCCTGCCAC " 


1489 
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2145 


AGCUCUCA CUGAUGAG X CGAA ACUGUGGC 


245 


GCCACAGT C TGAGAGCT 


1490 1 


2154 


GAGCCAGG CUGAUGAG X CGAA AGCUCUCA 


246 


TGAGAGCT C CCTGGCTC 


1491 1 


2162 


CCGUGUGG CUGAUGAG X CGAA AGCCAGGG 


247 


CCCTGGCT C CCACACGG 


1492 1 


2172 


AUUAUCUG CUGAUGAG X CGAA ACCGUGUG 


248 


CACACGGT T CAGATAAT 


1493 


2173 


GAUUAUCU CUGAUGAG X CGAA AACCGUGU 


249 


ACACGGTT C AGATAATC 


1494 


2178 


GCAGGGAU CUGAUGAG X CGAA AUCUGAAC 


250 


GTTCAGAT A ATCCCTGC 


1495 


2181 


GCAGCAGG CUGAUGAG X CGAA AUUAUCUG 


251 


CAGATAAT C CCTGCTGC 


1496 


2192 


GAGGGUAA CUGAUGAG X CGAA AUGCAGCA 


252 


TGCTGCAT T TTACCCTC 


1497 


2193 


UGAGGGUA CUGAUGAG X CGAA AAUGCAGC 


253 


GCTGCATT T TACCCTCA 


1498 


2194 


AUGAGGGU CUGAUGAG X CGAA AAAUGCAG 


254 


CTGCATTT T ACCCTCAT 


1499 


2195 


GAUGAGGG CUGAUGAG X CGAA AAAAUGCA 


255 


lUCAn ri ACCCTCATC 


1500 


2200 


UGCAUGAU CUGAUGAG X CGAA AGGGUAAA 


256 


TTTACCCT C ATCATGCA 


1501 


2203 


UGGUGCAU CUGAUGAG X CGAA AUGAGGGU 


257 


ACCCTCAT C ATGCACCA 


1502 


2214 


UUUGGCUA CUGAUGAG X CGAA AGUGGUGC 


258 


GCACCACT T TAGCCAAA 


1503 


2215 


AUUUGGCU CUGAUGAG X CGAA AAGUGGUG 


259 


CACC ACTT T AGCCAAAT 


1504 


2216 


AAUUUGGC CUGAUGAG X CGAA AAAGUGGU 


260 


ACCACTTT A GCCAAATT 


1505 


2224 


GG AGACAG CUGAUGAG X CGAA AUUUGGCU 


261 


AGCCAAAT T CTGTCTCC 


1506 


2225 


AGGAGACA CUGAUGAG X CGAA AAUUUGGC 


262 


GCCAAATI^ C TGTCTCCT 


1507 


2229 


AUGCAGGA CUGAUGAG X CGAA ACAGAAUU 


263 


AATTCTGT C TCCTGCAT 


1508 


2231 


GUAUGCAG CUGAUGAG X CGAA AGACAGAA 


264 


TTCTGTCT C CTGCATAC 


1509 


2238 


CCGGAGUG CUGAUGAG X CGAA AUGCAGGA 


265 


TCCTGCAT A CACTCCGG 


1510 


2243 


GCAUGCCG CUGAUGAG X CGAA AGUGUAUG 


266' 


CATACACT C CGGCATGC 


1511 


2254 


UGGUGUUG CUGAUGAG X CGAA AUGCAUGC 


267: 


GCATGCAT C CAACACCA 


1512 


2269 


CAUCUAGA CUGAUGAG X CGAA AGCCAUUG 


268- 


CAATGGCT T TCTAGATG 


1513 


2270 


UCAUCUAG CUGAUGAG X CGAA AAGCCAUU 


269' 


AATGGCTT T CTAG A TG A 


1514 


9071 


*^UUAUL,UA CUuAUuAu A COAA AAAGCCAU 


270 . 


ATGGCTTT C TAGATGAG 


1515 


2273 


CACUCAUC CUGAUGAG X CGAA AGAAAGCC 


271-. 


GGCTTTCT A GATGAGTG 


1516 


2287 


AGCAAAUG CUGAUGAG X CGAA AUGGCCAC 


272". 


G I GGCCAT T CATTTGCT 


1517 


2288 


AAGCAAAU CUGAUGAG X CGAA AAUGGCCA 


273' 


TGGCCATT C ATTTGCTT 


1518 


2291 


AGCAAGCA CUGAUGAG X CGAA AUGAAUGG 


274 


CCAn CAT T TGCTTGCT 


1519 


2292 


GAGCAAGC CUGAUGAG X CGAA AAUGAAUG 


275 


CATTCATT T GCTTGCTC 


1520 
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229 


6 AACUGAGC CUGAUGAG X CGAA AGCAAAUC 


J 1 276 


j (^A l i ruCTTGCTCAGtf 


1 1521 


230 


0 UAAGAACU CUGAUGAG X CGAA AGCAAGCA 


277 


TGCn'GC]' C AGTTCTTA 


1522 


230 


4 CCACUAAG CUGAUGAG X CGAA ACUGAGCA 


278 


TGCTCAGT T CTTAGTGG ■ 


1523 


230. 
230' 


> UCCACUAA CUGAUGAG X CGAA AACUGAGC 


279 


GCTCAGrr C TTAGTGGC" 


1524 




r GUGCCACU CUGAUGAG X CGAA AGAACt/GA 


280 


TCAGTTCT T AGTGGCAC ■ 


1525 


230J 
2318 


; UGUGCCAC CUGAUGAG X CGAA AAGAACUG" 


28! 


CAGTTCn A GTGGCACA 


1526 




AGACAGAA CUGAUGAG X CGAA AUGUGCCA 


282 


TGGCACAl CTTCTGTCT 


1527 


2320 
2321 


GAAGACAG CUGAUGAG X CGAA AGAUGUGC 


283 


GCACATCl' T CTGTCTTC 


1528 




AGAAGACA CUGAUGAG X CGAA AAGAUGUG" 


284 


CACATCri' C TGTCTTCT 


1529 


2325 
2327 


CAACAGAA CUGAUGAG X CGAA ACAGAAGA 


285 


IdTCTGT C TTCTGTTG 


1530 




CCCAACAG CUGAUGAG X CGAA AGACAGAA 


286 


TTCTGTCT T CTGTTGGG 


1531 


2328 


UCCCAACA CUGAUGAG X CGAA AAG AC AG A 


287 


I CTGTCTT C TGTTGGGA 


1532 


2332 


CUGUUCCC CUGAUGAG X CGAA ACAGAAGA 


288 


rciTCTGTTGGGAACAG 


1533 


2351 


AGCCUUGG CUGAUGAG X CGAA AUCCCUUU " 


289 


AAAGGGAT T CCAAGGCT 


1534 


2352 
2360 


UAGCCUUG CUGAUGAG X CGAA AAUCCCUU 


290 


AAGGGATT C CAAGGCTA" 


1535 




CAAAGAUU CUGAUGAG X CGAA AGCCUUGG 


291 


CCAAGGCT A AATCTTTG ' 


1536 


2364 


GUUACAAA CUGAUGAG X CGAA AUUUAGCC 


292 


GGCTAAAT C TTTGTAAC 


1537 


2366 


CUGUUACA CUGAUGAG X CGAA AGAUUUAG 


293 


CTAAATCT 1 TGIAACAG 


1538 


2367 


GCUGUUAC CUGAUGAG X CGAA AAGAUUUA 


294 


1 AAATCTT T GTAACAGC 


1539 


2370 


AGAGCUGU CUGAUGAG X CGAA ACAAAGAU " 


295 


AlCiTIGTAACAGCTCT " 


1540 


2377 


GGGAAAGA CUGAUGAG X CGAA AGCUGUUA 


296 


I AACAGCT C TCTTTCCC 


1541 


2379 


GGGGGAAA CUGAUGAG X CGAA AGAGCUGU ' " 


297 


ACAGCTCT c rrrccccc 


1542 




AAGGGGGA CUGAUGAG X CGAA AGAGAGCU 


298 


AGCTCTCT T TCCCCCTT " " 


1543 


2382 


CAAGGGGG CUGAUGAG X CGAA AAGAGAGC" " 


299 


GC lCl Cn TCCCCCTTG 


1544 


2383 


UCAAGGGG CUGAUGAG X CGAA AAAG AGAG 


300 


CTCTCTTTCCCCCTTGC " 


1545 


2389 


AACAUAGC CUGAUGAG X CGAA AGGGGGAA " 


301 


TTCCCCCTTGCTATGTT 


1546 


2393 
2397 


UAGUAACA CUGAUGAG X CGAA AGCAAGGG " 


302 


CCCTTGCT A TGTTACTA 


1547 




CUCUUAGU CUGAUGAG X CGAA ACAUAGCA " 


303 


TGCTATGT T ACTAAGCG " 


1548 


2398 


ACGCUUAG CUGAUGAG X CGAA AACAUAGC " 


304 


GCTATGTT A CTAAGCGT " " 


1549 


2401 


CUCACGCU CUGAUGAG X CGAA AGUAACAU" " 


305 


AIGITACTAAGCGTGAG 


1550 


2413 


GCUACGGG CUGAUGAG X CGAA AUCCUCAC" 


306 


GIGAGGATTCCCGTAGC 


1551 
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2414 


AGCUACGG CUGAUGAG X CGAA AAUCCUCA 


307 


TGAGGATT C CCGTAGCT 


1552 


2419 


UGAAGAGC CUGAUGAG X CGAA ACGGGAAU 


308 


ATTCCCGT A GCTCTTCA 


1553 


2423 


GCUGUGAA CUGAUGAG X CGAA AGCUACGG 


309 


CCGTAGCT C TTCACAGC 


1554 


2425 


CAGCUGUG CUGAUGAG X CGAA AGAGCUAC 


310 


GTAGCTCT T CACAGCTG 


1555 


2426 


UCAGCUGU CUGAUGAG X CGAA AAGAGCUA 


311 • 


TAGCTCTT C ACAGCTG A 


1556 


2438 


CAUAGACU CUGAUGAG X CGAA AGUUCAGC 


312 


GCTGAACT C AGTCl Al G 


1557 


2442 


AACCCAUA CUGAUGAG X CGAA ACUGAGUU 


313 


AACTCAGT C TATGGGTT 


1558 


2444 


CCAACCCA CUGAUGAG X CGAA AGACUGAG 


314 


CTCAGTCT A TGGGTTGG 


1559 


2450 


UGAGCCCC CUGAUGAG X CGAA ACCCAUAG 


315 


CTATGGGT T GGGGCTCA 


1560 


2457 


AGUUAUCU CUGAUGAG X CGAA AGCCCCAA 


316 


TTGGGGCT C AGATAACT 


1561 


2462 


CACAGAGU CUGAUGAG X CGAA AUCUGAGC 


317 


GCTCAGAT A ACTCTGTG 


1562 


2466 


AAUGCACA CUGAUGAG X CGAA AGUUAUCU . 


318 


AGATAACT C TGTGCATT 


1563 


2474 


GUAGCUUA CUGAUGAG X CGAA AUGCACAG 


319 


CTGTGCAT T TAAGCTAC 


1564 


2475 


AGUAGCUU CUGAUGAG X CGAA AAUGCACA 


320 


TGTGCATT T AAGCTACT 


1565 


2476 


AAGUAGCU CUGAUGAG X CGAA AAAUGCAC 


321 


GTGCATTT A AGCTACTT 


1566 


2481 


UCUACAAG CUGAUGAG X CGAA AGCUUAAA 


322 


TTTAAGCT A CTTGTAGA 


1567 


2484 


GUCUCUAC CUGAUGAG X CGAA AGUAGCUU 


323 


AAGCTACT T GTAGAGAC 


1568 


2487 


UGGGUCUC CUGAUGAG X CGAA ACAAGUAG 


324 


CTACTTGT A GAGACCCA 


1569 


2508 


AAAAUGUC CUGAUGAG X CGAA ACUCUCCA 


325 


TGGAGAGT A GACATTTT 


1570 


2514 


AGAGGCAA CUGAUGAG X CGAA AUGUCUAC 


326 


GTAGACAT T TTGCCTCT 


1571 


2515 


CAGAGGCA CUGAUGAG X CGAA AAUGUCUA 


327 


TAG ACATT T TGCCTCTG 


1572 


2516 


UCAGAGGC CUGAUGAG X CGAA AAAUGUCU 


328: 


AGACATTT T GCCTCTGA 


1573 


2521 


GCUUAUCA CUGAUGAG X CGAA AGGCAAAA 


329- 


TTTTGCCT C TGATAAGC 


1574 


2526 


AAAGUGCU CUGAUGAG X CGAA AUCAGAGG 


330- 


CCTCTGAT A AGCACTTT 


1575 


2533 


CAUUUAAA CUGAUGAG X CGAA AGUGCUUA 


331 


TAAGCACT T TTTAAATG 


1576 


2534 


CCAUUUAA CUGAUGAG X CGAA AAGUGCUU 


332- 


AAGCACTT T TTAAATGG 


1577 


2535 


GCCAUUUA CUGAUGAG X CGAA AAAGUGCU 


333 • 


AGCACTTT T TAAATGGC 


1578 


2536 


AGCCAUUU CUGAUGAG X CGAA AAAAGUGC 


334 


GCACTTTT T AAATGGCT 


1579 


2537 


GAGCCAUU CUGAUGAG X CGAA AAAAAGUG 


335 


CACTTTn A AA TGGCTC 


1580 


2545 


UAUUCUUA CUGAUGAG X CGAA AGCCAUUU 


336 


AAATGGCT C TAAGAATA 


1581 


2547 


CUUAUUCU CUGAUGAG X CGAA AGAGCCAU 


337 


ATGGCTCT A AGAATAAG 


1582 
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255 


3 CUGUGGCUCUGAUGAGXCGAAAUUCUUAG 


338 


Cl AAGAAT A AGCCACAG 


1583 


257 


0 CCACUUUA CUGAUGAG X CGAA AUUCUUaC 


339 


CAAAG AAT T TA AAGTGG 


1584 


257 


1 UCCACUUU CUGAUGAG X CGAA AAUUCUyU 


340 


AAAGAATT T AAAGTGGC 


1585 


257: 


I AGCCACUU CUGAUGAG X CGAA AAAUUCUU 


341 


AAGAATT I A AAGTGGCT 


1586 


258 


AAUUAAAG CUGAUGAG X CGAA AGCCACUU 


342 


AAGTGGCT C CTTTAA TI 


1587 


2584 


\ ACCAAUUA CUGAUGAG X CGAA AGGAGCCA 


343 


TGGCTCCT i l AATTGGT 


1588 


2585 


CACCAAUU CUGAUGAG X CGAA AAGGAGCC" 


344 


GGCTCCTT T AATTGGTG 


1589 


2586 


UCACCAAU CUGAUGAG X CGAA AAAGGAGC 


345 


GCTCCTTT A ATTGGTGA 


1590 


2589 


AAGUCACC CUGAUGAG X CGAA AUUAAAGG 


346 


CCJH AAT T GGTGACTT 


1591 


2597 


CUUUCUCC CUGAUGAG X CGAA AGUCACCA 


347 


TGGTGACT T GGAGAAAG 


1592 


2608 


CCUUGACC CUGAUGAG X CGAA AGCUUUCU 


348 


AGAAAGCl A GGTCAAGG 


1593 


2612 


AAACCCUU CUGAUGAG X CGAA ACCUAGCU" 


349 


AGCI AGGT C AAGGGTTT 


1594 


2619 


CUAUAAUA CUGAUGAG X CGAA ACCCUUGA" 


350 


I CAAGGGT T TATTATAG 


1595 


2620 


GCUAUAAU CUGAUGAG X CGAA AACCCUUG 


351 


CAAGGGTT T ATTATAGC 


1596 


2621 


UGCUAUAA CUGAUGAG X CGAA AAACCCUU 


352 


AAGGGTTT A TTATAGCA 


1597 


2623 


GGUGCUAU CUGAUGAG X CGAA AUAAACCC 


353 


GGGTTTAT T A TAGCACC 


1598 


2624 


GGGUGCUA CUGAUGAG X CGAA AAUAAACC 


354 


GGl 1 1 Al l A TAGCACCC 


1599 


2626 


GAGGGUGC CUGAUGAG X CGAA AUAAUAAA 


355 


riTATTAT A GCACCCTC 


1600 


2634 


GAAUACAA CUGAUGAG X CGAA AGGGUGCU 


356 


AGCACCCT C TTGTATTC 


1601 


2636 


AGGAAUAC CUGAUGAG X CGAA AGAGGGUG " 


357 


CACCCTCT T GTATTCCt " 


1602 


2639 


CAUAGGAA CUGAUGAG X CGAA ACAAGAGG 


358 


CCTCTTGTATTCCTATG " 


1603 


2641 


GCCAUAGG CUGAUGAG X CGAA AUACAAGA" " 


359 


rCTTGTAT T CCTA'rGGC " " 


1604 


2642 


UGCCAUAG CUGAUGAG X CGAA AAUACAAG 


360 


crrGTATT C CTATGGCA 


1605 


2645 


CAUUGCCA CUGAUGAG X CGAA AGGAAUAC " " 


361 


GTATTCCT A TGGCAATG 


1606 


2657 


CAUAAAAG CUGAUGAG X CGAA AUGCAUUG 


362 


CAATGCAT C CTTTTATG 


1607 


zoou 


UUUCAUAA CUUAUUAU X CGAA AGGAUGCA 


363 


TGCATCCT T TTATGAAA" " 


1608 


2661 


CUUUCAUA CUGAUGAG X CGAA AAGGAUGC 


364 


GCATCCTT T TATGAAAG 


1609 


2662 


ACUUUCAU CUGAUGAG X CGAA AAAGGAUG 


365 


CATCCirr 1 ATGAAAGT 


1610 


2663 


CACUUUCA CUGAUGAG X CGAA AAAAGGAU 


366 


AICCTTTTA IGAAAGTG 


1611 


2674 


UUAAGGUG CUGAUGAG X CGAA ACCACUUU 


367 


AAAGTGGT A CACCTTAA 


1612 


2680 


AAAGCUUU CUGAUGAG X CGAA AGGUGUAC " 


368 


GTAGACCT T AA AGCTTT 


1613 
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2681 


AAAAGCUU CUGAUGAG X CGAA AAGGUGUA 


369 


TACACCTT A AAGCTTTT 


1614 


2687 


UCAUAUAA CUGAUGAG X CGAA AGCUUUAA 


37J0' 


TTAAAGCT T TTATATGA 


1615 


2688 


GUCAUAUA CUGAUGAG X CGAA AAGCUUUA 


37f 


TAAAGCTT T TATATGAC 


1616 


2689 


AGUCAUAU CUGAUGAG X CGAA AAAGCUUU 


372 


AAAGCirr r atatgact 


1617 


2690 


CAGUCAUA CUGAUGAG X CGAA AAAAGCUU 


373 


AAGCTTTT A TATGACTG 


1618 


2692 


UACAGUCA CUGAUGAG X CGAA AUAAAAGC 


374 


GC r]"l"l AT A TGACTGTA 


1619 


2700 


UACUCUGC CUGAUGAG X CGAA ACAGUCAU 


375; 


ATGACTGT A GCAGAGTA 


1620 


2708 


UCACCAGA CUGAUGAG X CGAA ACUCUGCU 


376 


AGCAGAGT A TCTGGTGA 


1621 


2710 


AAUCACCA CUGAUGAG X CGAA AUACUCUG 


377 


CAGAGTAT C TGGTGATT 


1622 


2718 


GAAUUGAC CUGAUGAG X CGAA AUCACCAG 


378 


CTGGTGAT T GTCAATTC 


1623 


2721 


AGUGAAUU CUGAUGAG X CGAA ACAAUCAC 


379 


GTGATTGT C AATTCACT 


1624 


2725 


GGGAAGUG CUGAUGAG X CGAA AUUGACAA 


380 


TTGTCAAT T CACTTCCC 


1625 


2726 


GGGGAAGU CUGAUGAG X CGAA AAUUGACA 


381 


TGTCAATT C ACTTCCCC 


1626 


2730 


AUAGGGGG CUGAUGAG X CGAA AGUGAAUU 


382 


AATTCACT T CCCCCTAT 


1627 


2731 


UAUAGGGG CUGAUGAG X CGAA AAGUGAAU 


383 


ATTCACTT C CCCCTATA 


1628 


2737 


UAUUCCUA CUGAUGAG X CGAA AGGGGGAA 


384 


TTCCCCCT A TAGGAATA 


1629 


2739 


UGUAUUCC CUGAUGAG X CGAA AUAGGGGG 


385 


CCCCCTAT A GGAATAC A 


1630 


2745 


GCCCCUUG CUGAUGAG X CGAA AUUCCU AU 


386 


ATAGGAAT A CAAGGGGC 


1631 


2772 


AACUAGGG CUGAUGAG X CGAA AUCUGCCU 


387 


AGGCAGAT C CCCTAGTT 


1632 


2777 


UGGCCAAC CUGAUGAG X CGAA AGGGGAUC 


388 


GATCCCCT A GTTGGCCA 


1633 


2780 


UCUUGGCC CUGAUGAG X CGAA ACUAGGGG 


389 


CCCCTAGT T GGCCAAGA 


1634 


2791 


GUUAAAAU CUGAUGAG X CGAA AGUCUUGG 


390' 


CCAAGACT T ATTTTAAC 


1635 


2792 


AGUUAAAA CUGAUGAG X CGAA AAGUCUUG 


391 


CAAGACTT A mTAACT 


1636 


2794 


CAAGUUAA CUGAUGAG X CGAA AUAAGUCU 


392 


AGACTTAT T TTAACTTG 


1637 


2795 


UCAAGUUA CUGAUGAG X CGAA AAUAAGUC 


39>: 


G ACTTATT T TAACTTGA 


1638 


Z/VO 


AUCAAGUU CUGAUGAG X CGAA AAAUAAGU 


394- 


ACTTATTT T AACTTG AT 


1639 


2797 


UAUCAAGU CUGAUGAG X CGAA AAAAUAAG 


395- 


CTTATTTT A ACTTGATA 


1640 


2801 


AGUGUAUC CUGAUGAG X CGAA AGUUAAAA 


396; 


TTTTAAC r T GATACACT 


1641 


2805 


CUGCAGUG CUGAUGAG X CGAA AUCAAGUU 


397 


AAC ITGA T A CACTGCAG 


1642 


2816 


ACACUCUG CUGAUGAG X CGAA AUCUGCAG 


398 


CTGCAGAT T CAGAGTGT 


1643 


2817 


GACACUCU CUGAUGAG X CGAA AAUCUGCA 


399 


TGCAGATT C AGAGTGTC 


1644 
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lis: 

283 


15 AGCUUCAGCUGAUGAGXCGAAACAC^UC 


rr'400 


CAGAGTGT C CTGAAGCl 


1645 


284 


4 CAGAGGCACUGAUGAGXCGAAAGCUUCAC 


r 401 


CTGAAGCr C TGCCTCTG 


1646 


284 


0 UAAAGCCACUGAUGAGXCGAAAGGCAGAC 


r 402 


CTCTGCCTCTGGCTTTC 


1647 


284 


6 UG ACCGG A CUG AUGAG X CGAA AGGCAGAG 


403 


CTCTGGCT T TCCGGTCA 


1648 


284; 


7 AUGACCGGCUGAUGAGXCGAAAAGCCAGA 


404 


TCTGGCn l CCGGTCAT 


1649 




S t;AUGACCGCUGAUGAGXCGAAAAAGCCAG 


405 


CrGGCTTT C CGGTCATG ■ 


1650 


285: 
286C 


J UAACCCAUCUGAUGAGXCGAAACCGGAAA 


406 


ITTCCGGTCATGGGTTC 


1651 


2861 


) UUAACUGG CUGAUGAG X CGAA ACCCAUGA 


407 


TCATGGGT' T CCAGTTAA ' 


1652 


2866 


AUUAACUG CUGAUGAG X CGAA AACCCAUG 


408 


C ATGGGTT C CAGTTAAT 


1653 


2867 


CAUGAAUU CUGAUGAG X CGAA ACUGGAAC" 


409 


GTTCCAGT T AA1TCATG 


1654 


287! 


GCAUGAAU CUGAUGAG X CGAA AACUGGAA 


410 


MCCAGTTAATTCATGC " 


1655 


GAGGCAUG CUGAUGAG X CGAA AUUAACUG"' 


411 


CAGTTAAT 1* CATGCCTC" 


1656 


2878 


UGAGGCAU CUGAUGAG X CGAA AAUUAACU 


412 


AGTTAATT C ATGCCTCC 


1657 




UUCCAUGG CUGAUGAG X CGAA AGGCAUGA 


413 


TCATGCCT C CCATGGAC"" 


1658 


2889 


GCUCUCCA CUGAUGAG X CGAA AGGUCCAU 


414 


ATGGACCT A TGGAGAGC 


1659 


2905 
2909 


CUAAGAUC CUGAUGAG X CGAA ACUUGUUG 


415 


CAACAAGT T GATCTTAG " 


1660 


291 1 


UUAACUAA CUGAUGAG X CGAA AUCAACUU" 


416 


AAGTTGAT C TTAGTTAA 


1661 


2912 


ACUUAACU CUGAUGAG X CGAA AGAUCAAC " 


417 


G ITGATCT T AGTTAAGT 


1662 


2915 


GACUUAAC CUGAUGAG X CGAA AAGAUCAA 


418 


ITGATCTTAGTTAAGTC 


1663 


2916 


UGAGACUU CUGAUGAG X CGAA ACUAAGAU 


419 


ATCTTAGTTAAGTCTCC " 


1664 


2920 


GGGAGACU CUGAUGAG X CGAA AACUAAGA 


420 


TCTTAGTT A AGTCTCCC " " 


1665 


2922 


UAUAGGGA CUGAUGAG X CGAA ACUUAACU T 


421 


AGll AAGTC rCCCTATA " 


1666 




CAUAUAGG CUGAUGAG X CGAA AGACUUAA ' f 


422 


ITAAGTCTCCCTATATG 


1667 


2926 


CCCUCAUA CUGAUGAG X CGAA AGGGAGAC f 


423 


GICTCCCTATATGAGGG " 


1668 


2928 
2937 


AUCCCUCA CUGAUGAG X CGAA AUAGGGAG 


424 


CTCCCTAT A TGAGGGAT " ' 


1669 


2941 


L-uuAUuAu A COAA AUCCCUCA | 


425 


TGAGGG AT A AGTTCCTG""" 


1670 


2942 


AAAUCAGG CUGAUGAG X CGAA ACUUAUCC 


426 


GGATAAGTTCCTGATTl"" ' 


1671 


2948 " 


AAAAUCAG CUGAUGAG X CGAA AACUUAUC T 


427 


GATAAG1T C CTGATTTT " 


1672 




AAAACAAA CUGAUGAG X CGAA AUCAGGAA " 


428 


rrCCTGAT T TTTGTTTT " 


1673 


2949 : 


AAAAACAA CUGAUGAG X CGAA AAUCAGGA 


429 


TCCTGATT T TTGTTTTT 


1674 


2950 \ 


JAAAAACA CUGAUGAG X CGAA AAAUCAGG ' 


430 


CCTGATrr T TGTTTTTA 


1675 
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|295l 


AUAAAAAC CUGAUGAG X CGAA AAAAUCAG 


431 


CTGATITT T GTTTTTAT 


1676 


2954 


AAAAUAAA CUGAUGAG X CGAA ACAAAAAU 


432 


ATTTTTGT T TTTATTTT 


1677 


2955 


AAAAAUAA CUGAUGAG X CGAA AACAAAAA 


433 


TTTTTGTT T TTATTTTT 


1678 


2956 


CAAAAAUA CUGAUGAG X CGAA AAACAAAA 


434 


TTTTGTTT T TATTTTTG 


1679 1 


2957 


ACAAAAAU CUGAUGAG X CGAA AAAACAAA 


435 


TTTGnTr T A rrirrGT 


1680 


2958 


CACAAAAA CUGAUGAG X CGAA AAAAACAA 


436 


TTGmrr a tttttgtg 


1681 


2960 


AACACAAA CUGAUGAG X CGAA AUAAAAAC 


437 


GTTTTTAT T TTTGTGTT 


1682 


296 J 


UAACACAA CUGAUGAG X CGAA AAUAAAAA 


438 


TTm ATT T TTGTGTTA 


1683 


2962 


GUAACACA CUGAUGAG X CGAA AAAUAAAA 


439 


TTTTATTT T TGTGTTAC 


1684 


2963 


UGUAACAC CUGAUGAG X CGAA AAAAUAAA 


440 


TTTATTTT T GTGTTACA 


1685 


2968 


UCUUUUGU CUGAUGAG X CGAA ACACAAAA 


441 


n ri <j rti T T acaaaaga 


1686 


2969 


UUCUUUUG CUGAUGAG X CGAA AACACAAA 


442 


riTGTGTT A CAAAAGAA 


1687 


2984 


CAGGGAGG CUGAUGAG X CGAA AGGGCUUU 


443 


AAAGCCCT C CCTCCCTG 


1688 


2988 


AGUUCAGG CUGAUGAG X CGAA AGGGAGGG 


444 


CCCTCCCT C CCTGAACT 


1689 


2997 


CUUACUGC CUGAUGAG X CGAA AGUUCAGG 


445 


CCTGAACT T GCAGTAAG 


1690 


3003 


GCUGACCU CUGAUGAG X CGAA ACUGCAAG 


446 


cttgcagt a AGGTCAGC 


1691 


3008 


CUGAAGCU CUGAUGAG X CGAA ACCUUACU 


447 


AGTAAGGT C AGCTTCAG 


1692 


3013 


AGGUCCUG CUGAUGAG X CGAA AGCUGACC 


448 


GGTCAGCT T CAGG ACCl^ 


1693 


3014 


CAGGUCCU CUGAUGAG X CGAA AAGCUGAC 


449 


GTCAGCTT C AGGACCTG 


1694 


3024 


CCCACUGG CUGAUGAG X CGAA ACAGGUCC 


450 


GGACCTGT T CCAGTGGG 


1695 


3025 


GCCCACUG CUGAUGAG X CGAA AACAGGUC 


451 


GACCTGTT C CAGTGGGC 


1696 


3039 


GAUCCAAG CUGAUGAG X CGAA ACAGUGCC 


452 . 


GGCACTGT A CTTGGATC 


1697 


3042 


GAAGAUCC CUGAUGAG X CGAA AGUACAGU 


453 


ACTGTACT T GGATCTTC 


1698 


3047 


GCCGGGAA CUGAUGAG X CGAA AUCCAAGU 


454 


ACTTGGAT C TTCCCGGC 


1699 


3049 


ACGCCGGG CUGAUGAG X CGAA AGAUCCAA 


455: 


TTGGATCT T CCCGGCGT 


1700 


3050 


CACGCCGG CUGAUGAG X CGAA AAGAUCCA 


456. 


TGGATCTT C CCGGCGTG 


1701 


3068 


CCCUGUGU CUGAUGAG X CGAA AGGCACAC 


457. 


GTGTGCCT T ACACAGGG 


1702 


3069 


CCCCUGUG CUGAUGAG X CGAA AAGGCACA 


458 


TGTGCCrr A CACAGGGG 


1703 


3086 


CCACAGUG CUGAUGAG X CGAA ACAGUUCA 


459 


TGAACTGT T CACTGTGG 


1704 


3087 


ACCACAGU CUGAUGAG X CGAA AACAGUUC 


460 


GAACTGTT C ACTGTGGT 


1705 


3112 


CUACCAUU CUGAUGAG X CGAA ACCCUCAU 


461 


ATGAGGGT A AATGGTAG 


1706 
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31J9 
3l22 



CUUUCAAC CUGAUOAG X COAA ACCAUUUA 



13146 
3I5T 



3152 



3153 



UUUCUUUC CUGAUOAU X CUAA ACUACCA 
CUAAAUGC CUGAUOAG X CGAA ACACCAGG 
UAtiCiGCUA CUGAUGAG X CGAA AUGCAACA 



CCAGGGCU CUGAUOAG X CGAA AAUGCAAC 



CCCAUGGC CUGAUGAG X CGAA AAAUGCAA 



462 



463 



465 
"466" 



467 



TAAArGGTAGTTGAAAG 



ATGGiACil T GAAAGGAG 
UUJUUTGTTGCArrTAG 



TGTTGCATTTAGCCCTG 



gjiucatttagccctgg 

ilOUWll AGCCCTGGG 



1707 



1708 
1709" 



1710 



I7II 



1712 



468 



3182 



UUCUUCAC CUGAUGAG X CGAA AGUACUGU 



3192 
3T95 
3202 
3205 



GCCACAAC CUGAUGAG X CGAA AUCCUGCA 
UUAGCCAC CUGAUGAG X CGAA ACAAUCCU 
UUUUCUAG CU GAUGAG X CGAA AGCCACAA 
UUUUUCUC CUGAUGAG X CGAA AGUAGCCA 



469 
470 
471 
472 
473 



TGAACAGT A CTTGTGCA 



I7I3 



OuuuCiCCC CU GAUGAG X CGAA ACUUUCCC 



474 



3240 



475 



AC/^OIACMGTGCAGOA 
1 ULAUGAT T GTTGTGGC 
AUUA ITGT T GTGGCTAT 
IIGIGGCTACTAGAGAA 
ACl AGAGAACAA 



UUGAAAGT A GGGCAGAA" 



AACI UGA 1 A CAGTTCTG 



1714 j 
1715 

TtTT 
TnT 



1719 



1720 



476 



GAl ACAGl r CTGAGCAC" 



3246 



UUUUCUCA CUGAUGAG X CGAA AACUGUAU 



477 



ATACAGl 1 C TGAGCACA 



3263 



ACCUGAGC CUGAUGAG X CGAA AGUCUGGC 



478 



GCCAGACI TGCTCAGGT 



3267 



3293 



3297 



3305 



3306 
3309 
3323 



3328 
3329 
3346 



GUUCACCU CU GAUGAG X CGAA AGCAAGUC 



479 



GALUUCl CAGGTGGCC 



UUUCUAGG CUGAUGAG X CGAA AGCUGCAG 



480 



Cl UCAGCT A CCTAGGAA 



AAUGUuccJ UUUAUGAG X CGAA AGGUAGCU 



481 



CUGCaauu CUGAUGAG X CGAA AUGUUCCU 



UCUGCAAG CUGAUGAG X CGAA AAUGUUCC 



GGGUCUGC CUGAUGAG X CGAA AGGAAUGU 
GCAAAGGC CUGAUGAG X CGAA AUGCG 



3358 



3366 



CACCCCCA CUGAUGAG X CGAA AGGCAAUC," 
UCACCCCC CUGAUGAG X CGAA AAGGCAAU 
ACCCCAGG CUGAUGAG X CGAA AUCCCAGG" 



AGCUUGAC CUGAUGAG X CGAA ACCCCAGG 



AAUAGCUG CUGAUGAG X CGAA ACUACCCC 



AUGAAUAA CUGAUGAG X CGAA AGCUGGAC 
AAAUGAAU CUGAUGAG X CGAA AGAGCUGG 



AGCI ACCT A GGAACATT" 



482 



AGGAACA J- T CCTTGCAG 



483 
"487" 



GGAACATTCCffGCAGA 
ACAi 1 ccr T GCAGACCC 
CCCCGCAT T GCCTTTGG 
CAli GCCT T TGGGGGTG 
AITGCCTT T GGGGGTGC 
CCTGGGAT C CCTGGGGT 



489 



CClGGGGTAOrcCAGCT 



490 



491 



492 



Gggg l AGT C CAGCTClT 



GICCAGCTCTTATTCAT" 



CCAGCJCTTATTCATTf 



I72I 



1722 



1723 



1724 



1725 



1726 



1727 



1728 

TnT 

TnT 



1734 



1735 
T736" 



1737 
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3367 


GAAAUGAA CUGAUGAG X CGAA AAGAGCUG 


493 


CAGCTCTT A TTCATTTC 


1738 


3369 


GGGAAAUG CUGAUGAG X CGAA AUAAGAGC 


494^: 


GCTCri'AT T CATTTCCC 


1739 


3370 


UGGGAAAU CUGAUGAG X CGAA AAUAAGAG 


495:- 


CTCTTATT C ATTTCCCA 


1740 


3373 


CGCUGGGA CU6AUGAG X CGAA AUGAAUAA 


496; 


TTAn CAT T TCCCAGCG 


1741 


3374 


ACGCUGGG CUGAUGAG X CGAA AAUGAAUA 


497' 


TATTCATT T CCCAGCGT 


1742 


3375 


CACGCUGG CUGAUGAG X CGAA AAAUGAAU 


498 


ATTCATTT C CCAGCGTG 


1743 


3392 


CUUCUUCC CUGAUGAG X CGAA ACCAGGGC 


499 


GCCCTGGT T GGAAGAAG 


1744 


3408 


UACAACUU CUGAUGAG X CGAA ACAGCUGC 


500 


GCAGCTGT C AAGTTGTA 


1745 


3413 


CUGUCUAC CUGAUGAG X CGAA ACUUGACA 


501 


TGTCAAGT T GTAGACAG 


1746 


3416 


CAGCUGUC CUGAUGAG X CGAA ACAACUUG 


502 


CAAGITGT A GACAGCTG 


1747 


3428 


AUUGUAGG CUGAUGAG X CGAA ACACAGCU 


503 


AGCTGTGT T CCTACAAT 


1748 


3429 


AAUUGUAG CUGAUGAG X CGAA AACACAGC 


504 


GCTGTGTT C CTACAATT 


1749 


3432 


GCCAAUUG CUGAUGAG X CGAA AGGAACAC 


505 


GTGTl CCT A CAATTGGC 


1750 


3437 


GCUGGGCC CUGAUGAG X CGAA AUUGUAGG 


506 


CCTACAAT T GGCCCAGC 


1751 


3478 


GUQACAGC CUGAUGAG X CGAA ACGGUCCC 


507 


GGG ACCGT T GCTGTCAC 


1752 


3484 


UGAGUAGU CUGAUGAG X CGAA ACAGCAAC 


508 


GTTGCTGT C ACTACTCA 


1753 


3488 


AGCCUGAG CUGAUGAG X CGAA AGUGACAG 


509 


CTGTC ACT A CTCAGGCT 


1754 


3491 


GUCAGCCU CUGAUGAG X CGAA AGUAGUGA 


510 


TCACTACT C AGGCTGAC 


1755 


3511 


CGUAAUCU CUGAUGAG X CGAA ACCAGGCC 


511 


GGCCTGGT C AGAITACG 


1756 


3516 


GCAUACGU CUGAUGAG X CGAA AUCUGACC 


512 


GGTCAGAT T ACGTATGC 


1757 


3517 


GGCAUACG CUGAUGAG X CGAA AAUCUGAC 


513 


GTCAG ATT A CGTATGCC 


1758 


3521 


CAAGGGCA CUGAUGAG X CGAA ACGUAAUC 


514 


GATTACGT A TGCCCTTG 


1759 


3528 


AAACCACC CUGAUGAG X CGAA AGGGCAUA 


515 


TATGCCCT T GGTGGTTT 


1760 


3535 


UAUCUCUA CUGAUGAG X CGAA ACCACCAA 


516 


TTGGTGGT T TAGAGATA 


1761 


3536 


UUAUCUCU CUGAUGAG X CGAA AACCACCA 


517 ; 


TGGTGGTT T AGAGATAA 


1762 


J537 


AUUAUCUC CUGAUGAG X CGAA AAACCACC 


518 "l 


GGTGGTTT A GAGATAAT 


1763 


3543 


UUUUGGAU CUGAUGAG X CGAA AUCUCUAA 


519 : 


TTAGAGA r A ATCCAAAA 


1764 


3546 


UGAUUUUG CUGAUGAG X CGAA AUUAUCUC 


520 ■ 


GAGATAAT C CAAAATCA 


1765 


3553 


CAAACCCU CUGAUGAG X CGAA AUUUUGGA 


521 


TCCAAAAT C AGGGTTTG 


1766 


3559 


CCAAACCA CUGAUGAG X CGAA ACCCUGAU 


522 


ATCAGGGT T TGGTTTGG 


1767 


3560 


CCCAAACC CUGAUGAG X CGAA AACCCUGA 


523 


TCAGGGTT T GGTTTGGG 


1768 
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h:>M j CUUCCCCA CUGAUGAG X CGAA ACCAAACC 



ULUUCCCC CUGAUGAG X CGAA AACCAAAC 



AUUGGGAG CUGAUGAG X CGAA AUUUUCUU 526 



525 



GuriTGGTTTGGGGAAG 



OlilGGTI 'TGGGGAAGA 



AAGAAAAT C CTCCCCCT 



1769 



1770 



177! 



3581 
3587 



GGGGGAGG CUGAUGAG X CGAA AGGGGGAG 



527 
"528" 



AAAA 1 CCJ C CCCCTTCC 



1772 



C I CCCCCT T CCTCCCCC 



3588 
359l 



CUUGGGAG CUGAUGAG X CGAA AAGGGGGA 
UUGCGGGG CUGAUGAG X CGAA AGGAAGGG 



3603 
3604 



CUGUAGGG CUGAUGAG X CGAA ACGGGGCG 



UUGGUAGG CUGAUGAG X CGAA AACGGGGC 



GGAGGCGG CUGAUGAG X CGAA AGGGAACG 



529 
"530" 



531 



532 



533 



TCCCCCii CCTCCCCCG 



CCCllCCTCCCCCGCCC 



CGCCCCGITCCCTACCG 



GCCGCGTT C CCTACCGC 
CGTTCCCT A CCGCCTCC" 



1773 



1774 



1775 



1776 



1777 



1778 



534 



l ACCGCCT C CACTCCTG" 



1779 



3620 
3630 
3633 
3634 
3635 



GCUGGCAG CUGAUGAG X CGAA AGUGGAGG 
AAGGAAAU CUGAUGAG X CGAA AGCUGGCA 
UUUAAGGA CUGAUGAG X CGAA AUGAGCUG 
AUUGAAGG CUGAUGAG X CGAA AAUGAGCU 
AAUUGAAG CUGAUGAG X CGAA AAAUGAGC 



535 



CCTCCACTCCTGCCAGC 
iGCCAGCTCAriTCCTT 
CAUCTCAT T TCCTTCAA 
AGCTCATn CCnCAAT 

ucrrcATTT c cttcaatt" 



GUAAAUUG CUGAUGAG X CGAA AGGAAAUG 



?639 
3643 



AGOAAAUU CUGAUGAG X CGAA AAGGAAAU 



3644 



3645 



3648 



3649 



ULAAAGGA CUGAUGAG X CGAA AUUGAAGG 



OUCAAAGG CUGAUGAG X CGAA AAUUGAAG 



UGUCAAAG CUGAUGAG X CGAA AA AUUGAA 



AUAOGUCJA CUGAUGAG X CGAA AGGAAAUU 



UAUAGGUC CUGAUGAG X CGAA AAGGAAAU 



540 

■54r 



542 



543 



544 



545 



CAriTCCT T CAATTTCC 



AniCCrrCAATTTCCT 



CCTJ CAAT T TCCTTTGX 



CTTCAAITTCCTTTGAC 



TTC/VAin CCriTGACC 



546 



AA ri-I CCT T TGACCTAT" 



AIJICCIITGACCTATA 



1780 

TtST 

1782 

TtsT 



1785 



1786 



1787 



1788 



1789 



1790 



1791 



3662 



3676 



3698 



3699 



UUAUCCUA CUGAUGAG X CGAA AGGUCAAA 
uuuUAGCC CUGAUGAG X CGAA AUAGGUCA 



547 



111 GACCT A TAGGCTAA 
1 UACCTAT A GGCTAAAA 



UUCUUUUU CUGAUGAG X CGAA AGCCUAUA 



549 



i ATAGGCT A AAAAAGAA 



GCUGGAAU CUGAUGAG X CGAA AGCCUUUC 



550 



GAAAGGCT C ATTCCAGC 



UUUGCUGG CUGAUGAG X CGAA AUGAGCCU 



551 



AGGClCAl T CCAGCCAC 



UUUUGCUG CUGAUGAG X CGAA AAUGAGCC 



552 



Gud CAlT C CAGCCACA 



UCCCAGGG CUGAUGAG X CGAA AGGCUGCC 



553 



GGCAGCCT T CCCTGGGC" 



GGGCCAGG CUGAUGAG X CGAA AAGGCUGC 



554 



GCAGCCI l CCCTGGGCCf 



1792 
l79r 



1794 



1795 



1796 



1797 



1798 



1799 
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3709 


GAGAAGCA CUGAUGAG X CGAA AGGCCCAG 


555 


CTGGGCCT T TGCTTCTC 


1800 


3710 


AGAGAAGC CUGAUGAG X CGAA AAGGCCCA 


556':; 


TGGGCCIT T GCTTCTCT 


1801 


3714 


UGCUAGAG CUGAUGAG X CGAA AGCAAAGG 


557r 


CCTTTGCT T CTCTAGCA 


1802 


3715 


GUGCUAGA CUGAUGAG X CGAA AAGCAAAG 


558: 


CTTTGCTT C TCTAGCAC 


1803 


3717 


UUGUGCUA CUGAUGAG X CGAA AGAAGCAA 


559 


TTGC 11 CT C TAGCACAA 


1804 


3719 


AAUUGUGC CUGAUGAG X CGAA AGAGAAGC 


560. 


Gcrrcrcr a gcacaatt 


1805 


J727 


UAACCCAU CUGAUGAG X CGAA AUUGUGCU 


561 - 


AGCACAAT T ATGGGTTA 


1806 


J728 


GUAACCCA CUGAUGAG X CGAA AAUUGUGC 


562 r 


GCACAATT A TGGGTTAC 


1807 


3734 


AAGGAAGU CUGAUGAG X CGAA ACCCAUAA 


563 


TTATGGGT T ACTTCCTT 


1808 


3735 


AAAGGAAG CUGAUGAG X CGAA AACCCAUA 


564 


TATGGGTT A CTTCCTIT 


1809 


3738 


GAAAAAGG CUGAUGAG X CGAA AGUAACCC 


565 


GGGTTACT T CCTTTTTC 


1810 


3739 


AGAAAAAG CUGAUGAG X CGAA AAGUAACC 


566 


GGTTACIT C CTTniCT 


1811 


3742 


UUAAGAAA CUGAUGAG X CGAA AGGAAGUA 


567 


TACll'CCT T TTTCTTAA 


1812 


3743 


GUUAAGAA CUGAUGAG X CGAA AAGGAAGU 


568 


ACTTCCTT T TTCTTAAC 


1813 


3744 


UGUUAAGA CUGAUGAG X CGAA AAAGGAAG 


569 


CTTCCTTT T TCTTAAC A 


1814 


3745 


UUGUUAAG CUGAUGAG X CGAA AAAAGGAA 


570 


TTCCl rn T Cri AACAA 


1815 


3746 


UUUGUUAA CUGAUGAG X CGAA AAAAAGGA 


571 


TCCTTTTT C TTAACAAA 


1816 


3748 


UUUUUGUU CUGAUGAG X CGAA AGAAAAAG 


572 


CTTTTTCT T AAC AAAAA 


1817 


3749 


UUUUUUGU CUGAUGAG X CGAA AAGAAAAA 


573 


i rn i cn a acaaaaaa 


1818 


3764 


GGAAAUCA CUGAUGAG X CGAA ACAUUCUU 


574 


AAGAATGT T TGATTTCC 


1819 


3765 


AGGAAAUC CUGAUGAG X CGAA AACAUUCU 


575 


AGAATGTT T GATTTCCT 


1820 


3769 


CCAGAGGA CUGAUGAG X CGAA AUCAAACA 


576- 


TGTTTGAT T TCCTCTGG 


1821 


3770 


CCCAGAGG CUGAUGAG X CGAA AAUCAAAC 


577 : 


GTTTG ATT T CCTCTGGG 


1822 


3771 


ACCCAGAG CUGAUGAG X CGAA AAAUCAAA 


578 • 


TTTGATTT C CTCTGGGT 


1823 


3774 


GUCACCCA CUGAUGAG X CGAA AGGAAAUC 


579' 


GATTTCCT C TGGGTGAC 


1824 


3785 


A A A ATI t/^ A 1 \/~^ A f^f^ A A A ^ ^ ■ A 

CAGACAAU CUGAUGAG X CGAA AGGUCACC 


580" 


GGTG ACCT T ATTGTCTG 


1825 


3786 


ACAGACAA CUGAUGAG X CGAA AAGGUCAC 


58l;' 


GTGACCTT A TTGTa'GT 


1826 


3788 


UU AC AG AC CUGAUGAG X CGAA AUAAGGUC 


582- 


GACCTTAT J GTCTGTAA 


1827 


3791 


CAAUUACA CUGAUGAG X CGAA ACAAUAAG 


583- 


CTTATTGT C TGTAATTG 


1828 


3795 


GUUUCAAU CUGAUGAG X CGAA ACAGACAA 


584 


TI GTCTGI A ATTGAAAC 


1829 


3798 


AGGGUUUC CUGAUGAG X CGAA AUUACAGA 


585 


TC TGTAAT T GAAACCCT 


1830 
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38( 
38C 


M CCUCUCAA CUGAUGAGXCOAAAGGGUUUC 
>9 CACCUCUCCUGAUGAGXCGAAAUAGGGUIJ 


: 586 
r 587" 


j GAAACCCT A TTGAGAGfC 
AACCCTAT T GAGAGGTG 


J 1 1831 
!j 1832 


pa 

382 


2 cuaacacacugaugagxcgaaacaucacc 


588 


GGTGATGr C TGTGTTAG 


1833 


382 


8 cauuggcucugaugagxcgaaacacagac 


589 


UTCTGTGT'l'AGCCAATG 


1834 


384. 


y ucauuggccugaugagxcgaaaacacaga 


590 


lUlGTGTTAGCCAATGA 


1835 


385: 


> coagcagc cugaugag x cgaa accugggu 


591 


ACCCAGGJ' A GCTGCTCG 


1836 


385? 


J AOAAGCCC CUGAUGAG X cgaa AGCAGCUA 


592 


I AGCTGCT C GGGCTTCT 


1837 


3859 


; accaagag cugaugag x cgaa agcccgag " 


593 


CTCGGGa tctcttggt" 


1838 


3861 


uaccaaga cugaugag X cgaa aagcccga" 


594 


1 CGGGCTT C TC'iTGGTA 


1839 


3863 


CAUACCAA CUGAUGAG X CGAA AGAAGCCG 


595 


GUGCrrCT C ITGGTATG 


1840 


3867 


GACAUACC CUGAUGAG X CGAA AGAGAAGC" 


596 


GCITCTCTTGGTATGTC 


1841 


3871 


ACAAGACA CUGAUGAG X CGAA ACCAAGAG 


597 


CTCTTGGT A TGTCTTGT 


1842 


3873 


CCAAAC AA CUGAUGAG X CGAA ACAU ACCA 


598 


I GGTATGT C TTGTTTGG 


1843 


3876 


UUCCAAAC CUGAUGAG X CGAA AGACAUAC 


599 


GTATGTCT 1 GIITGGAA 


1844 


J877 


GUUUUCCA CUGAUGAG X CGAA ACAAGACA 


600 j 


1 * G rcrrcT t tggaaaag 


1845 


3890 


ACUUUUCC CUGAUGAG X CGAA AACAAGAC 


601 


GJCITG'ITTGGAAAAGT 


1846 


3891 


AUGAAUGA CUGAUGAG X CGAA AUCCACUlT 


602 


AAGTGGAT T TCATTCAT 


1847 


,s« 


AAUGAAUG CUGAUGAG X CGAA AAUCCACU" 


603 1 


AGIGGAITTCATTCATT 


1848 




AAAUGAAU CUGAUGAG X CGAA AAAUCCAC 


604 


GTGGATTT C ATTCATTtH 


1849 


3.95 

3896 


CAGAAAUG CUGAUGAG X CGAA AUGAAAUC " 


605 


G ATTTCAT T CATTTCTG " | 


1850 


3899 


UCAGAAAU CUGAUGAG X CGAA AAUGAAAU 


606 


A HTCATT C ATTTCTGA 


1851 


3900 


CAAUCAGA CUGAUGAG X CGAA AUGAAUGA 


607 


ICAnCATTTCTGATTG 


1852 




ACAAUCAG CUGAUGAG X CGAA AAUGAAUG " 


608 


CAITCATT T CTGATTGT 


1853 


3901 
3906 


GACAAUCA CUGAUGAG X CGAA AAAUGAAU " 


609 


A ITC Am- C TG ATTGTC 


1854 


I3909 


AACUGGAC CUGAUGAG X CGAA AUCAGAAA"" " 


610 


ITTCTGATTGTCCAGTT \ 


1855 


|39M 
3915 


^\j\jt\fw.\j\j ^^uoAUUAu A LGAA ACAAUCAG 


61 i 


C 1 G ATTGT C CAG ITAAG 


1856 


UAUCACUU CUGAUGAG X CGAA ACUGGACA " 


612 


lUTCCAGTTAAGTGATC ' 


1857 


3922 


UGAUCACU CUGAUGAG X CGAA AACUGGAC 


613 


GICCAGTTAAGTGATCA 


1858 I 


3940 


(.LUUUGGU CUGAUGAG X CGAA AUCACUUA 


614 


1 AAGTGAT C ACCAAAGG ' 


1859 


3968 " 


CCCUCCCA CUGAUGAG X CGAA AUUCUCAG " 


615 


c:tgagaat c tgggaggg 


1860 




CACAUAAA CUGAUGAG X CGAA ACUUUUUU "" " 


616 


AAAAAAGT T m ATGTG 


1861 
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83969 


GCACAUAA CUGAUGAG X CGAA AACUUUUU 


617 


AAAAAGTT T TTATGTGC 


1862 


3970 


UGCACAUA CUGAUGAG X CGAA AAACUUUU 


618' 


AAAAGTIT T TATGTGCA 


1863 


3971 


GUGCACAU CUGAUGAG X CGAA AAAACUUU 


619 


AAAGITIT T ATGTGCAC 


1864 


3972 


AGUGCACA CUGAUGAG X CGAA AAAAACUU 


620 


AAGTITTT A TGTGCACT 


1865 


3981 


CCAAAUUU CUGAUGAG X CGAA AGUGCACA 


621 


TGTGCACT T AAATTTGG 


1866 


3982 


CCCAAAUU CUGAUGAG X CGAA AAGUGCAC " 


622 


GTGCACTT A AATTTGGG 


1867 


3986 


UGUCCCCA CUGAUGAG X CGAA AUUUAAGU 


623 - 


ACTTAAAT T TGGGGACA 


1868 


3987 


UUGUCCCC CUGAUGAG X CGAA AAUUUAAG 


624 • 


CTTAAATT T GGGGACAA 


1869 


3997 


AUACAUAA CUGAUGAG X CGAA AUUGUCCC 


625 


GGGACAAT T TTATGTAT 


1870 


3998 


GAUACAUA CUGAUGAG X CGAA AAUUGUCC 


626 


GG ACAATT T TATGTATC 


1871 


3999 


AGAUACAU CUGAUGAG X CGAA AAAUUGUC 


627 


G ACAATTT T ATGTATCT 


1872 


4000 


CAGAUACA CUGAUGAG X CGAA AAAAUUGU 


628 


ACAATTTT A TGTATCTG 


1873 


4004 


AACACAGA CUGAUGAG X CGAA ACAUAAAA 


629 


nn ATGT A TCTGTGTT 


1874 


4006 


UUAACACA CUGAUGAG X CGAA AUACAUAA 


630 


TTATGTAT C TGTGTTAA 


1875 


4012 


AUAUCCUU CUGAUGAG X CGAA ACACAGAU 


631 


ATCTGTGT T AAGGATAT 


1876 


4013 


CAUAUCCU CUGAUGAG X CGAA AACACAGA 


632 


TCTGTGTT A AGGATATG 


1877 


4019 


CUUAAGCA CUGAUGAG X CGAA AUCCUUAA 


633 


TTAAGGAT A TGCTTAAG 


1878 


4024 


AUGUUCUU CUGAUGAG X CGAA AGCAUAUC 


634 


G ATATGCT T AAGAACAT 


1879 


4025 


UAUGUUCU CUGAUGAG X CGAA AAGCAUAU 


635 


ATATGCTT A AGAACATA 


1880 


4033 


AAAAGAAU CUGAUGAG X CGAA AUGUUCUU 


636 


AAGAACAT A ATTCTTTT 


1881 


4036 


AACAAAAG CUGAUGAG X CGAA AUUAUGUU 


637 


AAC ATAAT T CTTTTGTT 


1882 


4037 


CAACAAAA CUGAUGAG X CGAA AAUUAUGU 


638 


AC ATAATT C TTTTGTTG 


1883 


4039 


AGCAACAA CUGAUGAG X CGAA AGAAUUAU 


639 


ATAATTCT T TTGTTGCT 


1884 


4040 


CAGCAACA CUGAUGAG X CGAA AAGAAUUA 


640 


TAATTCTT T TGTTGCTG 


1885 


4041 


ACAGCAAC CUGAUGAG X CGAA AAAGAAUU 


641 < 


AATTCTTT T GTTGCTGT 


1886 


4044 


CAAACAGC CUGAUGAG X CGAA ACAAAAGA 


642 


TCTTTTGT T GCTGTTTG 


1887 


4050 


CUUAAACA CUGAUGAG X CGAA ACAGCAAC 


643 • 


GTI GCTG T r TGTTTAAG 


1888 


4051 


UCUUAAAC CUGAUGAG X CGAA AACAGCAA 


644 ■ 


TTGCTGTT T GTTTAAGA 


1889 


4054 


GCUUCUUA CUGAUGAG X CGAA ACAAACAG 


645 


CTGTTTGT T TAAG AAGC 


1890 


4055 


UGCUUCUU CUGAUGAG X CGAA AACAAACA 


646 


TGTTTGTT T AAG AAGCA 


1891 


4056 


GUGCUUCU CUGAUGAG X CGAA AAACAAAC 


647 


GTTTGTTT A AG AAGCAC 


1892 



wo 99/54459 



PCT/US99/08547 



78 



AACAAACucuUAUGA GACUAAA UCiUGCUUl 648 | AAGCACCrf AG^^ 
AAACAAAC CUGAUGAG X CGAA AAUGUGCU 649 



CUUAAACA CUGAUGAG XCGAAACUAAGGU 650 



UCUUAAAC CUGAUGAG X CGAA AACUAAGG 65 J 



O^UUCUUA CUGAUGAG X CGAA ACAAACUA" 



AGCAccn AGrrTGTfr 



ACLJiAGl rrGTTTAAG 



CcilAGITTGTTrAAGA 



652 



4093 



4096 



4098 
4ToT 



4105 
4107 
4108 



4110 



4I1I 



4112 



4130 



4132 



UUCUUCUU CUG AUGAG X CGAA AACAAACU -65r 
eiUUCUUCU CU GAUGAG X CGAA AAACAAAC I 654 
UACUAUAU CUGAUGAG X CGAA AGGUGCUU "l 655 
AUACUAUA CUGAUGAG X CGAA AAGGUGCU 656 
UUAUACUA CUGAUGAG X CGAA AUAAGGUG | 657" 
UAUUAUAC CUGAUGAG X CGAA . 



1 AG riTGT T TAAGAAGC 



AUAUAAGGl SsT 

AUAUAUUA CUGAUGAG X CGAA ACUAUAUA 659 

AUAUAUAU CUGAUGAG X CGAA AUACUAUA 660 

AAAAUAUA CUGAUGAG X CGAA AUUAUACU 661" 



AAAAAAUA CUGAUGAG X CGAA AUAUUAUA 662 
CAAAAAAA CUGAUGAG X CGAA AUAUAUUA 663 



AGlllGJJ I AAGAAGCA" 
OlilGTIT AAGAAGCAC 



AaucaCCT T ATATAGTA 
AUCACCTT A TATAGTAT 
CACCri AT A TAGTATAA 



CCJT A'l'AT A GTATAATA 



lAJATAGTATAATATAT 
lAlAGTATAATATATAT 



AU l A TAAT A TATATTTT 



uucAAAAA CUGAUGAG X CGAA AUAUAUAU 664 
UUUCAAAA CUGAUGAG X CGAA AAUAUAUA 665 " 



AUUUCAAA CUGAUGAG X CGAA AAAUAUAU 666 



AAUUUCAA CUGAUGAG X CGAA AAAAUAUA 667 



UAAUUUCA CUGAUGAG X CGAA AAAAAUAU 668" 



UUAAUUUC CUGAUGAG X CGAA AAAAAAUA 669 



AUCAAUGU CUGAUGAG X CGAA AUUUCAAA 670 ' 



AAGCAAUG CUGAUGAG X CGAA AAUUUCAA 671 " 
AAACAAGC CUGAUGAG X CGAA AUGUAAUU 672 



UUAUAAAC CUGAUGAG X CGAA AGCAAUGU 673 
UUCUGAUA CU GAUGAG X CGAA ACAAGCAA 674 



4134 



4142 



uuucUGAU CU GAUGAG X CGAA AACAAGCA 675 
UUUUCUGA CUGAUGAG X CGAA AAACAAGC 676" 



AAUUGUCU CUGAUGAG X CGAA AUAAACAA 677 



CUACAUUC CUGAUGAG X CGAA AUUGUCUG 678 



rAlAATATATAlTrrrf 



1895 



1896 



1897 



1898 



1899 



1 AA I ATAT A rTTTTTTG" 
AlArAlATTrmTGAA 



lATATATTTrrrrGAAA 



AIAIATTTTITTGAAAT 



lAlAiriTTITGAAATT 
A 1 ATTTTT r TGAAATTA 



I A l l rrrr t g aaattac" 



JllUAAATTACATTGCT 



1 rUAAATT A CATTGCTT" 



AA I j ACAT T GCTTGTTT ■ 



ACArrGCT t gtitatca" 



iiGciiurr tatcagac- 



ioliigh tatcagaca 



CiC n OTTTA TCAGACAA" 



MUITTATCAGACAATT 
CAGACAAT T GAATG TAG 



1900 

T9or 



1903 



1904 
T90r 



1906 



1907 



1908 



1909 



1910 



1911 



1912 



1913 
"1914" 



1915 



1916 



1917 



1918 



1919 



1920 



1921 



1922 
T92r 
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4149 


AGAAUUAC CUGAUGAG X CGAA ACAUUCAA 


679 


TTGAATGT A GTAATTCT 


1924 


4152 


AACAGAAU CUGAUGAG X CGAA ACUACAUU 


6807 


AATGTAGT A ATTCTGTT 


1925 


4155 


CAGAACAG CUGAUGAG X CGAA AUUACUAC 


68 r 


GTAGTAAT T CTGTTCTG 


1926 


4156 


CCAGAACA CUGAUGAG X CGAA AAUU ACUA 


682' 


TAGTAATT C TGTTCTGG 


1927 


4160 


AAAUCCAG CUGAUGAG X CGAA ACAGAAUU 


683 


AATTCTGT T CTGGATTT 


1928 


4161 


UAAAUCCA CUGAUGAG X CGAA AACAGAAU 


684' 


ATTCTGTT C TGG ATTTA 


1929 


4167 


UCAAAUUA CUGAUGAG X CGAA AUCCAGAA 


685 


TTCTGGAT T TAATTTGA 


1930 


4168 


GUCAAAUU CUGAUGAG X CGAA AAUCCAGA 


686 


TCTGGATT T AATTTGAC 


1931 


4169 


AGUCAAAU CUGAUGAG X CGAA AAAUCCAG 


687 


CTGGATTT A A7TTGACT 


1932 


4172 


CCCAGUCA CUGAUGAG X CGAA AUUAAAUC 


688 


GAITTAAT T TGACTGGG 


1933 


4173 


ACCCAGUC CUGAUGAG X CGAA AAUUAAAU 


689 


ATTTAATT T GACTGGGT 


1934 


4182 


UGCAUGUU CUGAUGAG X CGAA ACCCAGUC 


690 


GACTGGGT T AACATGCA 


1935 


4)83 


UUGCAUOU CUGAUGAG X CGAA AACCCAGU 


691 


ACTGGGTT A ACATGCAA 


1936 


4207 


AAACUAAA CUGAUGAG X CGAA AUUUUUCC 


692 


GGAAAAAT A TTTAGTTT 


1937 


4209 


AAAAACUA CUGAUGAG X CGAA AU AUUUUU 


693 


AAAAATAT T TAGTTTTT 


1938 


4210 


AAAAAACU CUGAUGAG X CGAA AAUAUUUU 


694 


AAAA TAIT T AGTTTTTT 


1939 


4211 


AAAAAAAC CUGAUGAG X CGAA AAAUAUUU 


695 


AAATATTT A GTTTTTTT 


1940 


4214 


AAAAAAAA CUGAUGAG X CGAA ACUAAAUA 


696 


TATTTAGT T Tl lTiTn 


1941 


4215 


AAAAAAAA CUGAUGAG X CGAA AACUAAAU 


697 


ATITAGTT T rmTTTT 


1942 


4216 


AAAAAAAA CUGAUGAG X CGAA AAACUAAA 


698 


TTTAGTTT T TTTTTTTT 


1943 


4217 


AAAAAAAA CUGAUGAG X CGAA AAAACUAA 


699 


TTAGTi rn TlTT l'lTT 


1944 


4218 


AAAAAAAA CUGAUGAG X CGAA AAAAACUA 


700^ 


TAGTTTTT TTiTTliri 


1945 


4219 


AAAAAAAA CUGAUGAG X CGAA AAAAAACU 


70r 


AGTTTJTi 1' irrrnTT 


1946 


4220 


AAAAAAAA CUGAUGAG X CGAA AAAAAAAC 


702 ■ 




1947 


4221 


AAAAAAAA CUGAUGAG X CGAA AAAAAAAA 


703*' 


rrnrrn 1 "iTiTnTT 


1948 


4222 


CAAAAAAA CUGAUGAG X CGAA AAAAAAAA 


704-' 




1949 


4223 


ACAAAAAA CUGAUGAG X CGAA AAAAAAAA 


705 , ' 


rnT'iTn t ttttttgt 


1950 


4224 


UACAAAAA CUGAUGAG X CGAA AAAAAAAA 


706^ 


irirrrn'TiTrrrGrA 


1951 


4225 


AUACAAAA CUGAUGAG X CGAA AAAAAAAA 


707 


TTTTTTTT T TTTTGTA T 


1952 


4226 


UAUACAAA CUGAUGAG X CGAA AAAAAAAA 


708 


ITTTTTTT T TTTGTATA 


1953 


4227 


GUAUACAA CUGAUGAG X CGAA AAAAAAAA 


709 


TTTTTTTT T 1TGTATAC 


1954 
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14228 

4229 



4232 
4234 



4237 
4238 



4239 
4240 



AUUAUACA CUGAUGAG X CGAA AAAAAAAA 
AAUUAUAC CUGAUGAG X CGAA AAAAAAAA 



GaaaAGUA CUGAUGAG X CGAA ACAAAAAA 



UUUAAAAG CUGAUGAG X CGAA AUACAAAA 



AUC'UUGAA CUGAUGAG X CGAA AGUAUACA 



UAUCUUGA CUGAUGAG X CGAA AAGUAUAC 



UUAGCUUG CUGAUOAO X CGAA AAAGUAUA 



GuuAOCUU CUGAUGAG X CGAA AAAAGUAU 




4268 



4270 



4276 



4289 



4290 



4292 



4296 



4297 



UUACUUUA CUGAUGAG X CGAA ACAUGACA 
AAUGACUG CUGAUGAG X CGAA AUACAUGA 
UCAUAAAU CUGAUGAG X CGAA ACUGUAUA 



UAUGCAUA CUGAUGAG X CGAA AUGACUGU 



UUAUCiCAU CU GAUGAG X CGAA AAUGACUG " 



UUUAUUCA CU GAUGAG X CGAA AAAUGACU 



^^AUUCUU CU GAUGAG X CGAA AGGCAUAA 



710 



712 

"tiT 



721 



724 

72r 



TTTTTTTT T TGrATACT 
Tiri-J'i-ITTGTATACTT 



TTTTTTGT A TACTTTIC 



miGlATACTTTTCAA 
iGlAJACi J ITCAAGCT 



lUICATGTATACAGTCA 
ICATGTATACAGTCATT 
I AFACAGT C ATTTATGC 



ACAGlCATTTATGCCr: 



CAGICATI TATGCCTAA 



726 



AAAUGAAU CUGAUGAG X CGAA AUCACCAG 



UAAAUGAA CUGAUGAG X CGAA AAUCACCA 



UUUAAAUG CUGAUGAG X CGAA AUAAUCAC 



AUUUAAAU CUGAUGAG X CGAA AAUAAUCA 



UUCAUUUA CUGAUGAG X CGAA AUGAAUAA 



CUUCAUUU CUGAUGAG X CGAA AAUGAAUA 



727 



728 



729 



730 



731 



732 



733 



AG 1 CATTT A TGCCTAAA 



TTATGCCT A AAGCCTGG 



CTGGTGAT T ATTCATTT 



TGGlGAl i ATTCATTTA 



GTGATTAT T CAITTAAA 



IGAllAlTCATTTAAAf 



riAIlCATTTAAATGAA 



TAirCATTTAAATGAAG 



1955 



1956 



1957 



1958 
T959" 



1966 
7967*" 
T96r 



1969 



1970 



1971 



1972 



1973 



1974 



1975 



1976 



1977 



1978 



UCUUCAUU CUGAUGAG X CGAA AAAUGAAU 



734 



AllCATiTAAATGAAGA 



4308 
4313 



UGAAAUGU CUGAUGAG X CGAA AUCUUCAU 



735 



ATGAAGAT C ACAT7TCA 



UUAUAUGA CUGAUGAG X CGAA AUGUGAUC 



736 



GAIGACATTTCATATCA 



UuuAUAUG CUGAUGAG X CGAA AAUGUGAU 



4315 



uuuuAUAU CUGAUGAG X CGAA AAAUGUGA 



737 

"73F 



AIGACATTTCATATCAA 



TCACAMTCATATCAAC 



4318 



AAAOUUUA CUGAUGAG X CGAA AUGAAAUG 



739 



CAinCATATCAACTTT 



CAAAAGUU CUGAUGAG X CGAA AUAUGAAA 



740 



TTTCAIAJ CAACTTTTG 



1979 



1981 



1982 



1983 



1984 
■T98r 
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[4325 


GGAUACAA CUGAUGAG X CGAA AGUUGAUA 


741 


TATCAACT T TTGTATCC 


1986 1 


4326 


UGGAUACA CUGAUGAG X CGAA AAGUUGAU 


742" 


ATCAACTT T TGTATCCA 


1987 


4327 


GUGGAUAC CUGAUGAG X CGAA AAAGUUGA 


743- 


TCAACTTT T GTATCCAC 


1988 


4330 


ACUGUGGA CUGAUGAG X CGAA ACAAAAGU 


744; 


ACTTTTGT A TCCACAGT 


1989 


4332 


CUACUGUG CUGAUGAG X CGAA AUACAAAA 


745' 


TTTTGTAT C CACAGTAG 


1990 


4339 


AUUUUGUC CUGAUGAG X CGAA ACUGUGGA 


746- 


TCCACAGT A GACAAAAT 


1991 


4348 


AUUAGUGC CUGAUGAG X CGAA AUUUUGUC 


747 


GACAAAAT A GCACTAAT 


1992 


4354 


AUCUGGAU CUGAUGAG X CGAA AGUGCUAU 


748 


ATAGCACT A ATCCAGAT 


1993 


4357 


GGCAUCUG CUGAUGAG X CGAA AUUAGUGC 


749 


GCACTAAT C CAGATGCC 


1994 


4367 


UCCAACAA CUGAUGAG X CGAA AGGCAUCU 


750 


AGATGCCT A TTGTTGGA 


1995 


4369 


UAUCCAAC CUGAUGAG X CGAA AUAGGCAU 


751 


ATGCCTAT T GTTGGATA 


19% 


4372 


CAAUAUCC CUGAUGAG X CGAA ACAAUAGG 


752 


CCTATTGT T GGATATTG 


1997 


4377 


UCAUUCAA CUGAUGAG X CGAA AUCCAACA 


753 


TGTTGG AT A TTGAATG A 


1998 


4379 


UGUCAUUC CUGAUGAG X CGAA AUAUCCAA 


754 


TTGGATAT T GAATGAC A 


1999 


4394 


CUACAUAA CUGAUGAG X CGAA AUUGUCUG 


755 


CAGACAAT C TTATGTAG 


2000 


4396 


UGCUACAU CUGAUGAG X CGAA AGAUUGUC 


756 


GACAATCT T ATGTAGCA 


2001 


4397 


UUGCUACA CUGAUGAG X CGAA AAGAUUGU 


757 


ACAATCTT A TGTAGCAA 


2002 


4401 


AUCUUUGC CUGAUGAG X CGAA ACAUAAGA 


758 


TCTTATGT A GCAAAGAT 


2003 


4410 


UCAGGCAU CUGAUGAG X CGAA AUCUUUGC 


759 


GCAAAGAT T ATGCCTGA 


2004 


4411 


UUCAGGCA CUGAUGAG X CGAA AAIJCUUUG 


760 


CAAAGATT A TGCCTGAA 


2005 


4429 


CCCUGAAU CUGAUGAG X CGAA AUUUUCCU 


761 


AGGAAAAT T ATTCAGGG 


2006 


4430 


GCCCUGAA CUGAUGAG X CGAA AAUUUUCC 


762'- 


GGAAAATF A TTCAGGGC 


2007 


4432 


CUGCCCUG CUGAUGAG X CGAA AUAAUUUU 


763 


AAAATTAT T CAGGGCAG 


2008 


4433 


GCUGCCCU CUGAUGAG X CGAA AAUAAUUU 


764 


AAATTATT C AGGGCAGC 


2009 


4443 


AGCAAAAU CUGAUGAG X CGAA AGCUGCCC 


765^ 


GGGCAGCT A ATTTTGCT 


2010 


4446 


AAAAGCAA CUGAUGAG X CGAA AUUAGCUG 


766; 


CAGCTAAT T TTGCTm" 


2011 


4447 


UAAAAGCA CUGAUGAG X CGAA AAUUAGCU 


767. 


AGCTAATT T TGCTITTA 


2012 


4448 


GUAAAAGC CUGAUGAG X CGAA AAAUUAGC 


768' 


GCTAATTT T GCITI I AC 


2013 1 


4452 


UUUGGUAA CUGAUGAG X CGAA AGCAAAAU 


769 


ATTTTGCT T TTACCAAA 


2014 


4453 


UUUUGGUA CUGAUGAG X CGAA AAGCAAAA 


770 


TTTTGCTT T TACCAAAA 


2015 


4454 


AUUUUGGU CUGAUGAG X CGAA AAAGCAAA 


771 


TTTGCTTT T ACCAAAAT 


2016 



wo 99/54459 



PCT/US99/08547 



82 



10 



15 



20 



25 



30 



-.^Jj UAUUUUGGCUUAUGAGXCGAAAAAAGCAA 



h'^ts I ACUACUGA CUliAUGAG X CUAA AUUUUGGU 
UUACUACU CUGAUGAO X CUAA AUAUUUUG 



p469 



P472 



p475 



4477 



AAUAUUAC CUUAUOAG X CGAA ACUGAUAU 



772 

"ttT 



774 

"TtT 



AAAAAUAU CUUAUUAG X CGAA ACUACUGA" ^76 



UCL.AAAAA CUUAUUAG X CGAA AUUACUAC 777 



ilUCI-ITrACCAAAATA I 2017" 
ACCAAAAT A 1 LAG 1 AGT 2018 " 



CAAAATAT C AOTAGTAA"] — 2019" 



AlATCAGTAGTAATATT I — 2020 
i«-AUiAUJ AAlAIUil 2021 



4478 
4479 
4480 
^488 
4492 
4499 



'4510 
45Ti 
45T2 



uuuuLAAA CUG AUGAO X CGAA AUAUUACU 778 - 



cuuucCAA CUG AUGAO X CGAA AAUAUUAC 779' 

ACUOUCCA CUOAUOAG X CGAA AAAUAUUA 780 

UACUOUCC CU GAUGAO X CGAA AAAAUAUU 781 

^CAUUAUC CUOAUOAG X CGAA AtUGUCCA 782 ' 

uuACUCAU CUG AUOAG X CGAA AOCUACUO | 783 
AACCCACU CUOAUOAG X CGAA ACCCAUUA 



OlAOrAATAllillOOA I 2022 
AOiAAiATTTilOUACA | 2023 



OlAAIATTTTiOOACAG I 2024" 
i AAl A I'lT T TGGACAGT ] — TOTT 
AAl AITIT T GGACAGTA" 



2026 




4513 



UUAAAAAG CUOAUOAG X CGAA ACCCACUG" 
AUUAAAAA CUOAUOAG X CGAA AACCCACU 
AUAUUAAA CUOAUOAG X COAA AGAACCCA" 
AACAUUAA CUOAUOAG X CGAA AAGAACCC 
AAACAUUA CUOAUOAG X CGAA AAAGAACC 



lUOACAOTAt^CJlAAlGG | 2027 
«-AU l AOCT A ATGGGTCa" 



2028 



TAATGGGT C AGTGOGTT i — 2029" 



4514 



I45I9 



4523 



4526 



UAAACAUU CUOAUOAG X CGAA AAAAGAAC " 



AUAAACAU CUOAUOAG X CGAA AAAAAOAA 791 



UAAOUAUA CUOAUOAG X CGAA ACAUUAAA 792 " 
«-UAAOUAU CUGAUGAO X CGAA AACAUUAA 793 " 



ucuAAOUA CUOAUOAG X CGAA AAACAUUA 794 " 



AAUUUAAO CUOAUOAG X CGAA AUAAACAU 795 ' 
UAAAAUCU CUGAUGAO X CGAA AOUAUAAA 796 



4527 



4531 



4532 



AUAAAAUC CUOAUOAG X CGAA AAGUAUAA I 797" 
UAAAAOAA CUOAUOAG X CGAA AUCUAAGU 798 



K536 



uuAAAAOA CUOAUOAG X CGAA AAUCUAAG 799 
UUUAAAAG CUOAUOAG X CGAA AAAUCUAA 800 



UUUUAAAA CUOAUOAG X CGAA AAAAUCUA 
UUUUUUAA CUOAUOAG X CGAA AGAAAAUC 



CAOTGGGT 1 CriTTTAA 
AUIOOGTTC ITTTTAAT 
lUOOITCTT i ri AATGT 
uuol I CTT T ITAATOTT" 
GGTTmTTTAATGfiT 
OIlCrnTTAATGTTTA 



TTciim AATGTTTAT I — 2036" 
IIIAATGTTTAIACITA 2037 
HAAioiTTAiACIlAO I 2038" 
1 AA I u I ri A I ACTTAGA 1 — 2039" 



AloriTATACTTAGATT 



2040 



IITArACTTAGATTTrc 



2041 



llAIACTlAGAirilCT 2042 



ACHAGATT TTCnTlA I 204r 

CUAOA-lTTTCilUAA | 2044" 



801 

~m2 



llAOAl lTTCTlllAAA | 2045" 
lAOAiTITCnilAAAA i 2046 



UAiuid-niAAAAAA 2047 
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14537 


UUUUUUUA CUGAUGAG X CGAA AAGAAAAU 


803 


ATTTTCTT T TAAAA AA A 


2048 


4538 


AUUUUUUU CUGAUGAG X CGAA AAAGAAAA 


804 


TTTTCTTT T AAAAAAAT 


2049 


4539 


AAUUUUUU CUGAUGAG X CGAA AAAAGAAA 


805: 


rri CTTIT A AAAAAATT 


2050 


4547 


UUUAUUUU CUGAUGAG X CGAA AUUUUUUU 


806: 


AAAAAAAT T AAAATAAA 


2051 


4548 


UUUUAUUU CUGAUGAG X CGAA AAUUUUUU 


807 


AAAAAATT A AAATAAAA 


2052 


4553 


UUUUGUUU CUGAUGAG X CGAA AUUUUAAU 


808 


ATTAAAAT A AAACAAAA 


2053 


4567 


GUCCUAGA CUGAUGAG X CGAA AUUUUUUU 


809". 


AAAAAAAT T TCTAGGAC 


2054 


4568 


AGUCCUAG CUGAUGAG X CGAA AAUUUUUU 


810 


AAAAAATT T CTAGGACT 


2055 


4569 


UAGUCCUA CUGAUGAG X CGAA AAAUUUUU 


811 


AAAAATTT C TAGGACTA 


2056 


4571 


UCUAGUCC CUGAUGAG X CGAA AGAAAUUU 


812 


AAATTTCT A GGACTAGA 


2057 


4577 


ACAUCGUC CUGAUGAG X CGAA AGUCCUAG 


813 


CTAGGACT A GACGATGT 


2058 


4586 


GCUGGUAU CUGAUGAG X CGAA ACAUCGUC 


814 


GACGATGT A ATACCAGC 


2059 


4589 


UUAGCUGG CUGAUGAG X CGAA AUUACAUC 


815 


GATGTAAT A CCAGCTAA 


2060 


4596 


UUUGGCUU CUGAUGAG X CGAA AGCUGGUA 


816 


TACCAGCT A AAGCCAAA 


2061 


4609 


CACUGUAU CUGAUGAG X CGAA AUUGUUUG 


817 


CAAACAAT T ATACAGTG 


2062 


4610 


CCACUGUA CUGAUGAG X CGAA AAUUGUUU 


818 


AAACAATT A TACAGTGG 


2063 


4612 


UUCCACUG CUGAUGAG X CGAA AUAAUUGU 


819 


ACAATTAT A CAGTGG AA 


2064 


4624 


UAAUGUAA CUGAUGAG X CGAA ACCUUCCA 


820 


TGG AAGGT T TTACATTA 


2065 


4625 


AUAAUGUA CUGAUGAG X CGAA AACCUUCC 


821 


GGAAGGTT T TAG ATTAT 


2066 


4626 


AAUAAUGU CUGAUGAG X CGAA AAACCUUC 


822 


GAAGGTTT T ACATTATT 


2067 


4627 


GAAUAAUG CUGAUGAG X CGAA AAAACCUU 


823 


AAGGTTTT A CATTATTC 


2068 


4631 


GGAUGAAU CUGAUGAG X CGAA AUGUAAAA 


824 


TTTTACAT T ATTCATCC 


2069 


4632 


UGGAUGAA CUGAUGAG X CGAA AAUGUAAA 


825 


riTACATT A rrCATCCA 


2070 


4634 


AUUGGAUG CUGAUGAG X CGAA AUAAUGUA 


826 


TACATTAT T CATCCA AT 


2071 


4635 


CAUUGGAU CUGAUGAG X CGAA AAUAAUGU 


827 


ACATTATT C ATCC AATG 


2072 


4638 


ACACAUUG CUGAUGAG X CGAA AUGAAUAA 


828 


TTATTCAT C CAA TGTGT 


2073 


4647 


UGAAUAGA CUGAUGAG X CGAA ACACAUUG 


829 


CAATGTGT T TCTATTCA 


2074 


4648 


AUGAAUAG CUGAUGAG X CGAA AACACAUU 


830 


AATGTGTT T CTATTCAT 


2075 ' 


4649 


CAUGAAUA CUGAUGAG X CGAA AAACACAU 


831 


ATGTGTTT C TATTCATG 


2076 


4651 


AACAUGAA CUGAUGAG X CGAA AGAAACAC 


832 


GTGTTTCr A TTCATGTT 


2077 


4653 


UUAACAUG CUGAUGAG X CGAA AUAGAAAC 


833 


GTTTCTAT T CATGTTAA 


2078 
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f 1 1 r ~ 1 rill s==>--— 


p^., ^uurt/^t,Aui,uUAUOAQXCGAAAAUAGAAi 


^l 834" 


TlTCTATT ("A TrTT a Ah 
»in-i/Mi\^A1011 AAG 


2079 


-ojy AUUAUCUU CUGAUGAG X CGAA ACAUGAAl 


i 835 


ATTCATGT T A An ATA r^T 


2080 


4660 UAUUAUCU CUGAUGAG X CGAA AACAUG/V 


i 836 


*■ * >^r\ 1 VJ 1 1 rt. /\OA 1 AC 1 A 


2081 


-ooo UUUAOUAG CUGAUGAG X CGAA AUCUUAAC 


: 837 


GTTAAGAT A rTAnrTr^ 


2082 


4668 AAAUUUAG CUGAUGAG X CGAA AGUAUCUU 


838 


ACIACAJTT 


2083 


4671 UUCaaaUG CUGAUGAG X CGAA AGUAGUAU 


839 


AT ACT A CT A rATTTnAlT 


2084 


4675 CCACUUCA CUGAUGAG X CGAA AUGUAGUA 


840 


TACTAPAT T TriA a r-rnr^ 
1 \ t\\^f\ 111 o AACj 1 CjO 


2085 


46/6 CCCACUUC CUGAUGAG X CGAA AAUGUAGU 


841 


ACTACAIT T GAAGTGGG 


2086 


|46y5 AAUCaulu CUGAUGAG X CGAA AUOUUCUC 


842 


A 1 c A(jA 1 GATT 


2087 


4703 AACAUUUC CUGAUGAG X CGAA AUCAUCUG 


843 


v^rtu/iioAJ 1 OAAATGTr 


2088 


471 1 CCUGUUCU CUGAUGAG X CGAA ACAUUUCA 


844 


ivj/wviui 1 CUCCCAGG 


2089 


4712 CCCUUUGC CUGAUGAG X CGAA AACAUUUC 


845 


\3t^j\i\ I u 1 1 c UCCcAGGG 


2090 


4723 UUGCUGGA CUGAUGAG X CGAA ACCCCUGG 


846 


v^v^/ivjuuuJ c ICCAGCAA 


2091 


4/25 AGUUGCUG CUGAUGAG X CGAA AGACCCCU 
|4/34 GAUUUCGA CUGAUGAG X CGAA AGUUGCUG 


847 
848 


Auouu 1 c 1 C CAGCAACT 
v,rt.vjiw/vAC 111 CjOAAATC 


2092 1 

2093 1 


4733 AGAUUUCC CUGAUGAG X CGAA AAGUUGCU 


849 


/v'jv^/i/ii^ 1 J 1 uOAAATCT 


2094 


4742 UACAAAGA CUGAUGAG X CGAA AUUUCCAA 


850 


TTGGA A AT O ^^n^rv^n-vl — 


2095 


4/44 AAUACAAA CUGAUGAG X CGAA AGAUUUCC 


851 


GGAAATPT C TTTr'TATT- 


2096 


4/46 AAAAUala CUGAUGAG X CGAA AGAGAUUU 


852 


AAATCTPT T THTATTtt 


2097 


474 / AAAAAUAC CUGAUGAG X CGAA AAGAGAUU ' 


853 


AATCTCTT T GTATTTTT 


2098 


4750 AOUAAaaa CUGAUGAG X CGAA ACAAAGAO 


854 


CTCTTTGT ATTTTTArr " 


2099 


4752 CAAGUAAA CUGAUGAG X CGAA AUACAAAG " 


855 


CTTTGTAT T TTTArrrr. " 


2100 


4 /53 UCAAGUAA CUGAUGAG X CGAA AAUACAAA " 


856 


ITTGl A'l 1 I \\ Arw€\A — 


2101 


4754 UUCAAGUA CUGAUGAG X CGAA AAAUAC/VA " 


857 


rruTA'rrr t TAcwr a a — 


2102 


4/55 CUUCAAGU CUGAUGAG X CGAA AAAAUACA " 


858 


TUTAnTTTACrrGAAG" " 


2103 


4/56 ACUUCAAG CUGAUGAG X CGAA AAAAAUAC ' 


859 


U 1 ATTTTT A CTTGAAGT 


2104 


4759 GGCACUUC CUGAUGAG X CGAA AGUAAAAA ~ 


860 


11 rMACTTGAAGTGCC " 


2105 1 


4771 CUGUCCAU CUGAUGAG X CGAA AGUGGCAC ~ 


861 


U 1 CiCCACT A A'l'GGACAG " 


2106 


<* /«5 CCAGAaaa CUGAUGAG X CGAA AUCUGCUG 


862 


CACiCAGAT A TTTTCTGG " 


2107 


4 /87 AGCCAGAA CUGAUGAG X CGAA AUAUCUGC ~ 


863 


UCAGATAT T TTCTGGCT ' 


2108 


4 788 CAGCCAGA CUGAUGAG X CGAA AAUAUCUO 


864 


CAUAIATTTTCTGGCTG 


2109 
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4789 


UCAGCCAG CUGAUGAG X CGAA AAAUAUCU 


865 


AGATATTT T CTGGCTGA 


2110 


4790 


AUCAGCCA CUGAUGAG X CGAA AAAAUAUC 


866 


G ATATTTT C TGGCTGAT 


2111 


4801 
|4805 


CCAAUACC CUGAUGAG X CGAA ACAUCAGC 


867 

St- 


GCTGATGT T GGTATTGG 


2112 


ACACCCAA CUGAUGAG X CGAA ACCAACAU 


868 


ATGTTGGT A TTGGGTGT 


2113 


4807 


CUACACCC CUGAUGAG X CGAA AUACCAAC 


869 


GTTGGTAT T GGGTGTAG 


2114 


4814 


CAUGUUCC CUGAUGAG X CGAA ACACCCAA 


870 


TTGGGTGT A GGAACATG 


2115 


4825 


UUUUUUUA CUGAUGAG X CGAA AUCAUGUU 


871 


AACATG AT T TAAAAAAA 


2116 


4826 


UUUUUUUU CUGAUGAG X CGAA AAUCAUGU 


872 


AC ATGATT T AAAAAAAA 


2117 


4827 


UUUUUUUU CUGAUGAG X CGAA AAAUCAUG 


873 


CATGATTT A AAAAAAAA 


2118 


4839 


AGAGGCAA CUGAUGAG X CGAA AGUUUUUU 


874 


AAAAAACT C TTGCCTCT 


2119 


4841 


GCAGAGGC CUGAUGAG X CGAA AGAGUUUU 


875 


AAAACTCT T GCCTCTGC 


2120 


4846 


GGAAAGCA CUGAUGAG X CGAA AGGCAAGA 


876 


TCTTGCCT C TGCTTTCC 


2121 


4851 


GUGGGGGA CUGAUGAG X CGAA AGCAGAGG 


877 


CCTCTGCT T TCCCCCAC 


2122 


4852 


AGUGGGGG CUGAUGAG X CGAA AAGCAGAG 


878 


CTCTGCTT T CCCCCACT 


2123 


4853 


GAGUGGGG CUGAUGAG X CGAA AAAGCAGA 


879 


TCTGCTTT C CCCCACTC 


2124 


4861 


UUGCCUCA CUGAUGAG X CGAA AGUGGGGG 


880 


CCCCCACT C TGAGGCAA 


2125 


4872 


UACAUUUU CUGAUGAG X CGAA ACUUGCCU 


881 


AGGCAAGT T AAAATGTA 


2126- 


4873 


UUACAUUU CUGAUGAG X CGAA AACUUGCC 


882 


GGCAAGTT A AAATGTAA 


2l27v 


4880 


ACAUCUUU CUGAUGAG X CGAA ACAUUUUA 


883 


TAAAATGT A AAAGATGT 


2128. 


4892 


CCCAGAUA CUGAUGAG X CGAA AUCACAUC 


884 


GATGTGAT T TATCTGGG 


2129 


4893 


CCCCAGAU CUGAUGAG X CGAA AAUCACAU 


885 


ATGTGATT T ATCTGGGG 


2130 


4894 


CCCCCAGA CUGAUGAG X CGAA AAAUCACA 


886 


TGTGATTT A TCTGGGGG 


2131 


4896 


GCCCCCCA CUGAUGAG X CGAA AUAAAUCA 


887 


TGATTTAT C TGGGGGGC 


2132 


4906 


CCAUACCU CUGAUGAG X CGAA AGCCCCCC 


888 


GGGGGGCT C AGGTATGG 


2133 


4911 


CCCCACCA CUGAUGAG X CGAA ACCUGAGC 


889 


GCTCAGGT A TGGTGGGG 


2134 




UAUUCUUU UUOAUUAO A COAA AUCCACUU 


890- 


AAGTGG AT T CAGGAA'I C 


2135 


4929 


AGAUUCCU CUGAUGAG X CGAA AAUCCACU 


891 


AGTGGATT C AGGAATCT 


2136 


4936 


AUUCCCCA CUGAUGAG X CGAA AUUCCUGA 


892 


TCAGGAAT C TGGGGAAT 


2137 


4952 


UCUUAAUA CUGAUGAG X CGAA AUUUGCCA 


893 


TGGCAAAT A TATTAAGA 


2138 


4954 


CUUCUUAA CUGAUGAG X CGAA AUAUUUGC 


894 


GCAAATAT A TfAAGAAG 


2139 


4956 


CUCUUCUU CUGAUGAG X CCiAA AUAUAUUU 


895 


AAAT ATAT T AAG AAG AG 


2140 
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5H7 


UUACUGCU CUGAUGAG X CGAA AUGUUGUU 


927 


AACAACAT C AGCAGTAA 


2172 


5124 


AUGGACUU CUGAUGAG X CGAA ACUGCUGA 


928 


TCAGCAGT A AAGTCCAT 


2173 


5129 


AUUCCAUG CUGAUGAG X CGAA ACUUUACU 


929 


AGTAAAGT C CATGGAAT 


2174 


5138 


CCACUAGC CUGAUGAG X CGAA AUUCCAUG 


930 


CATGGAAT A GCTAGTGG 


2175 


5142 


CAGACCAC CUGAUGAG X CGAA AGCUAUUC 


93! 


GAATAGCT A GTGGTCTG 


2176 


5148 


GAAACACA CUGAUGAG X CGAA ACCACUAG 


932 


CI AGTGGT C TGTGTTTC 


2177 


5154 


CGAAAAGA CUGAUGAG X CGAA ACACAGAC 


933 


GTCTGTGT T TCTTTTCG 


2178 


5155 


GCGAAAAG CUGAUGAG X CGAA AACACAGA 


934 ' 


TCTGTGTT T CTTTTCGC 


2179 


5156 


GGCGAAAA CUGAUGAG X CGAA AAACACAG 


935 


CTGTGTTT C TTTTCGCC 


2180 


5158 


AUGGCGAA CUGAUGAG X CGAA AGAAACAC 


936 


GTGTTTCT T TTCGCCAT 


2181 


5159 


AAUGGCGA CUGAUGAG X CGAA AAGAAACA 


937 


TGTTTCTT T TCGCCATT 


2182 


5160 


CAAUGGCG CUGAUGAG X CGAA AAAGAAAC 


938 


G'I"rrCTTT T CGCCATTG 


2183 


5161 


GCAAUGGC CUGAUGAG X CGAA AAAAGAAA 


939 


TTTCTTTT C GCCATTGC 


2184 


5167 


AGCUAGGC CUGAUGAG X CGAA AUGGCGAA 


940 


TTCGCCAT T GCCTAGCT 


218S 


5172 


CGGCAAGC CUGAUGAG X CGAA AGGCAAUG 


941 


CATTGCCT A GCTTGCCG 


2186 


5176 


AUUACGGC CUGAUGAG X CGAA AGCUAGGC 


942 


GCCTAGCT T GCCGTAAT 


2187 


5182 


AGAAUCAU CUGAUGAG X CGAA ACGGCAAG 


943 


CTTGCCGT A ATGATTCT 


2188 ; 


5188 


CAUUAUAG CUGAUGAG X CGAA AUCAUUAC 


944 


GTAATGAT T CTATAATG 


2189 . 


5189 


GCAUUAUA CUGAUGAG X CGAA AAUCAUUA 


945 


TAATGATT C TATAATGC 


2190,' 


5191 


UGGCAUUA CUGAUGAG X CGAA AGAAUCAU 


946 


ATGATTCT A TAATGCCA 


2191 


5193 


GAUGGCAU CUGAUGAG X CGAA AUAGAAUC 


947 


GATTCTAT.A ATGCCATC 


2192 


5201 


UGCUGCAU CUGAUGAG X CGAA AUGGCAUU 


948 


AATGCCAT C ATGCAGCA 


2193 


5212 


CCUCUCAU CUGAUGAG X CGAA AUUGCUGC 


949 


GCAGCAAT T ATGAGAGG 


2194 


5213 


GCCUCUCA CUGAUGAG X CGAA AAUUGCUG 


950 


CAGCAATT A TGAGAGGC 


2195 


5223 


GGAUGACC CUGAUGAG X CGAA AGCCUCIJC 


951 


GAGAGGCT A GGTCATCC 


2196 


5227 


CUUUGGAU CUGAUGAG X CGAA ACCUAGCC 


952 


GGCTAGGT C ATCCAAAG 


2197 


5230 


UCUCUUUG CUGAUGAG X CGAA AUGACCUA 


953" 


TAGGTCAT C CAAAGAGA 


2198 


5246 


UACAUUGA CUGAUGAG X CGAA AGGGUCUU 


954 


AAG ACCC 1 A 1 CAATGTA 


2199 


5248 


CCUACAUU CUGAUGAG X CGAA AUAGGGUC 


955 


GACCCTAT C AATGTAGG 


2200 


5254 


UUGCAACC CUGAUGAG X CGAA ACAUUGAU 


956 


ATCAATGT A GGTTGCAA 


2201 


5258 


GAUUUUGC CUGAUGAG X CGAA ACCUACAU 


957 


ATGTAGGT T GCAAA ATC 


2202 
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5266 



5275 
5288 



AGGGOUUA CUGAUGAG X CGAA AUUUUGCA I 958 



UUAUGGGU CUUAUGAG X CGAA AGAUUUUG 



UACUUCCU CUGAUGAG X CGAA AGGGGUUA 



AAAUCAAA CUGAUGAG X CGAA ACUGCACU 



UCAAAUCA CUUAUGAG X CGAA AGACUGCA 



AUCa/vauu CUUAUGAG X CGAA AAGACUGC 



959 



TGCAAAAT C TAACCCCT" 



caaaatct a ACCCCTAA" 



960 



961 



962 



GGAAAUCA CUGAUGAG X CGAA AUCAAAGA 



963 



2203 



2204 



I AACCCCT A AGGAAGTG 



AGTGCAGT C TTTGATTT 



TGLAUlCrTTGATTTGA 



964 



GCAGTCTTTGAJ"1"IGAT 



TCrilGArTTGATTTCC" 



2205 



2206 



2207 



2208 



2209 



5296 
5300 



GGGAaauu UUUAUGAG X CGAA AAUCAAAG 
ACUAGGGA CUGAUGAG X CGAA AUCAAAUC 



965 

'966' 



CH-rGAITTGATTTCCC 
UAITTGAT T TCCCTAGT 



2210 
l2ir 



5302 



UACUAGGG CUGAUGAG X CGAA AAUCAAAU 
UUACUAGG CUGAUGAG X CGAA AAAUCAAA 



967 
"96r 



Al nUATT T CCCTAGTA" 
lllGAJTrCCCTAGTAA 



AAUGUUAC CUGAUGAG X CGAA AGGGAAAU 



969 



AiriCCCTAGTAACaT 



5309 



UUCAAGGU CUGAUGAG X CGAA ACUAGGGA" 



970 



1 CCCTAGT A ACCTTGCA 



AUAUCUGC CUGAUGAG X CGAA AGGUUACU 



971 



AGTAACCT T GCAGATAT 



5321 



GUUAAACA CUGAUGAG X CGAA AUCUGCAA 



5325 CUUGGUUA CUGAUGAG X CGAA ACAUAUCU 



972 



TTGCAGAT A TGTTTAAC 



973 



AGA lATGT T TAACCAAG 



2212 



2214 



2215 



2216 



2217 



2218 



GCUUGGUU CUGAUGAG X CGAA AACAUAUC 



5327 



UUCUUGGU CUGAUGAG X CGAA AAACAUAU 



5338 
5349 



GCauouuu CUGAUGAG X CGAA AUGGCUUG 
UCCCUCAA CUGAUGAG X CGAA AGGCAUGG 



5350 



AUCCCUCA CUGAUGAG X CGAA AAGGCAUG 



5351 



CAGCCCUC CUGAUGAG X CGAA AAAGGCAU 



974 



GATATGTT T AACCAAGC 



975 



ATAI G rn A ACCAAGCC 



976 
"97r 



caagccat a gcccatgc 

CCATGCCT T TTGAGGGC 



978 



CATGCCIT T TG AGGGCT 



979 



ATGCCTTT T GAGGGCTG 



2219 



2220 



2221 
■222r 

"22^" 



2224 



AAGUCCCU CUGAUGAG X CGAA AUUUGUUC 



980 



GAACAAAT A AGGG ACTT 



UUAUCAGU CUUAUGAG X CGAA AGUCCCUU 



981 



AAGGOACr T ACTGATAA 



AUUAUCAG CUGAUGAG X CGAA AAGUCCCU 



982 



AGGGACTT A CTG ATAAT 



AAGUAAAU CUGAUGAG X CGAA AUCAGU AA 



983 



riACTGATAATTI'ACTr 



5385 



CAAAAGUA CUGAUGAG X CGAA AUUAUCAG 



984 



CTGA 1 AAT T TACTTTTG" 



2225 
"222r 



2227 



2228 



2229 

2230 



UCAAAAGU CUGAUGAG X CGAA AAUUAUCA" 



985 



5387 
5390 



AUC/VAAAU CUGAUGAG X CGAA AAAUUAUC 



986 



UUGAUCAA CUGAUGAG X CGAA AGUAAAUU 



987 



UGUOAUCA CUGAUGAG X CGAA AAGUAAAU 



988 



TGATAATT 1 ACTTTTGA 



GAlAAriTACTTrrGAT" 



AATTTACl 1 'nGATCAC" 



AniACi'lTTGATCACA" 



2231 
"2232" 



2233 
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5392 


AUGUGAUC CUGAUGAG X CGAA AAAGUAAA 


989 


rriAcrri'TGATCACAT 


2234 


5396 


CUUAAUGU CUGAUGAG X CGAA AUCAAAAG 


990 ii 


crrriGAT c acattaag 


2235 


5401 


AACACCUU CUGAUGAG X CGAA AUGUGAUC 


991 


GATCACAT T AAGGTGTT 


2236 


5402 


GAACACCU CUGAUGAG X CGAA AAUGUGAU 


992 


ATCACATT A AGGTGTTC 


2237 


5409 


AAGGUGAG CUGAUGAG X CGAA ACACCUUA 


993 


TAAGGTGT T CTCACCTT 


2238 


5410 


CAAGGUG A CUGAUGAG X CGAA AACACCUU 


994 


AAGGTGIT C TCACCTTG 


2239 


5412 


UUCAAGGU CUGAUGAG X CGAA AG AACACC 


995 


GGTGTTCT C ACCTTGAA 


2240 


5417 


AAGAUUUC CUGAUGAG X CGAA AGGUGAGA 


996 


TCTCACCT T GAAATCTT 


2241 


5423 


GUGUAUAA CUGAUGAG X CGAA AUUUCAAG 


997 


CTTGAAAT C TTATACAC 


2242 


5425 


CAGUGUAU CUGAUGAG X CGAA AGAUUUCA 


998 


TGAAATCT T ATACACTG 


2243 


5426 


UGAGUGUA CUGAUGAG X CGAA AAGAUUUC 


999 


GAAATCTT A TACACTGA 


2244 


5428 


UUUCAGUG CUGAUGAG X CGAA AUAAGAUU 


1000 


AATCTTAT A CACTGAAA 


2245 


5444 


CCUAAAUC CUGAUGAG X CGAA AUGGCCAU 


1001 


ATGGCCAT T GATTTAGG 


2246 


5448 


GUGGCCUA CUGAUGAG X CGAA AUCAAUGG 


1002 


CCATTGAT T TAGGCCAC 


2247 


5449 


AGUGGCCU CUGAUGAG X CGAA AAUCAAUG 


1003 


CATTGATT T AGGCCACT 


2248 


5450 


CAGUGGCC CUGAUGAG X CGAA AAAUCAAU 


1004 


ATTGA TTT A GGCCACTG 


2249 


5462 


AGUACUCU CUGAUGAG X CGAA AGCCAGUG 


1005 


CACTGGCT T AGAGTACT 


2250 


5463 


GAGUACUC CUGAUGAG X CGAA AAGCCAGU 


1006 


ACTGGCn A GAGTACTC 


2251 


5468 


GGAAGGAG CUGAUGAG X CGAA ACUCUAAG 


1007 


CTTAGAGT A CTCCTTCC 


2252 


5471 


AGGGGAAG CUGAUGAG X CGAA AGUACUCU 


1008 


AGAGTACT C CTTCCCCT 


2253 


5474 


UGCAGGGG CUGAUGAG X CGAA AGGAGUAC 


1009 


GTACTCCT T CCCCTGCA 


2254 


5475 


AUGCAGGG CUGAUGAG X CGAA AAGGAGUA 


1010 


TACTCCTT C CCCTGCAT 


2255 


5493 


GUAUUUGU CUGAUGAG X CGAA AUCAGUGU 


1011 


ACACTGAT T ACAAATAC 


2256 


5494 


AGUAUUUG CUGAUGAG X CGAA AAUCAGUG 


1012 


CACTGATT A CAAATACT 


2257 


5500 


UAGGAAAG CUGAUGAG X CGAA AUUUGUAA 


1013 


TTACAAAT A CTTTCCTA 


2258 




uAAUAuGA CUGAUGAG X CGAA AGUAUUUG 


1014 


CAAATACT T TCCTATTC 


2259 


5504 


UGAAUAGG CUGAUGAG X CGAA AAGUAUUU 


1015 . 


AAATACTT T CCTATTCA 


2260 


5505 


AUGAAUAG CUGAUGAG X CGAA AAAGUAUU 


1016 


AATACTTT C CTATTCA F 


2261 


5508 


AGUAUGAA CUGAUGAG X CGAA AGGAAAGU 


1017 


ACITTCCT A 1TCATACT 


2262 


5510 


AAAGUAUG CUGAUGAG X CGAA AUAGGAAA 


1018 


TTTCCl Al T CATAC1TT 


2263 


5511 


GAAAGUAU CUGAUGAG X CGAA AAUAGGA A 


1019 


TTCCTATT C ATACTTTC 


2264 
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15668 


CUUGCACC CUGAUGAG X CGAA AAGAGAAU 


1051 


ATTCTCTT T GGTGCAAG 


2296 


5678 


GCUCCCAA CUGAUGAG X CGAA ACUUGCAC 


1052 


GTGCAAGT C TTGGGAGC 


2297 


5680 


ACGCUCCC CUGAUGAG X CGAA AGACUUGC 


1053: 


GCAAGTCT T GGGAGCGT 


2298 


5692 


GUAAUCUA CUGAUGAG X CGAA AUCACGCU 


1054' 


AGCGTGAT C TAGATTAC 


2299 


5694 


GUGUAAUC CUGAUGAG X CGAA AGAUCACG 


1055' 


CGTGATCT A GATTACAC 


2300 


5698 


UGCAGUGU CUGAUGAG X CGAA AUCUAGAU 


1056 


ATCTAGAT T ACACTGCA 


2301 


15699 


GUGCAGUG CUGAUGAG X CGAA AAUCUAGA 


1057' 


TCTAGATT A CACTGCAC 


2302 


15711 


AACUUGGG CUGAUGAG X CGAA AUGGUGCA 


1058. 


TGCACCAT T CCCAAGTT 


2303 


55712 


UAACUUGG CUGAUGAG X CGAA AAUGGUGC 


1059 


GCACCATT C CCAAGTTA 


2304 


5719 


AGGGGAUU CUGAUGAG X CGAA ACUUGGGA 


1060 


TCCCAAGl T AATCCCCT 


2305 


5720 


CAGGGGAU CUGAUGAG X CGAA AACUUGGG 


1061 


CCCAAGTT A ATCCCCTG 


2306 


5723 


UUUCAGGG CUGAUGAG X CGAA AUUAACUU 


1062 


AAGTTAAT C CCCTGAAA 


2307 


5735 


UUGAGAGU CUGAUGAG X CGAA AGUUUUCA 


1063 


TGAAAACT T ACTCTCAA 


2308 


5736 


GUUGAGAG CUGAUGAG X CGAA AAGUUUUC 


1064 


GAAAACTT A CTCTCAAC 


2309 


5739 


CCAGUUGA CUGAUGAG X CGAA AGUAAGUU 


1065 


AACTTACT C TCAACTGG 


2310 


5741 


CUCCAGUU CUGAUGAG X CGAA AGAGUAAG 


1066 


CTTACTCT C AACTGGACj 


2311 


5760 


UGGGACCA CUGAUGAG X CGAA AGUUCAUU 


1067 


AATGAACT T TGGTCCCA 


2312 


5761 


UUGGGACC CUGAUGAG X CGAA AAGUUCAU 


1068 


ATGAACTT T GGTCCCAA 


2313 


5765 


AUAUUUGG CUGAUGAG X CGAA ACCAAAGU 


1069 


ACTTTGGT C CCAAATAT 


2314 


5772 


AAGAUGGA CUGAUGAG X CGAA AUUUGGGA 


1070 


TCCCAAAT A TCCATCTT 


2315 


5774 


AAAAGAUG CUGAUGAG X CGAA AUAUUUGG 


1071 


CCAAATAT C CATCTTTT 


2316 


5778 


ACUGAAAA CUGAUGAG X CGAA AUGGAUAU 


1072 


ATATCCAl C TTTTCAGT 


2317 


5780 


CUACUGAA CUGAUGAG X CGAA AGAUGGAU 


1073 


ATCCATCT T TTCAGTAG 


2318 


5781 


GCUACUGA CUGAUGAG X CGAA AAGAUGGA 


1074 


TCCATCTT T TCAGTAGC 


2319 


5782 


CGCUACUG CUGAUGAG X CGAA AAAGAUGG 


1075 


CCATCTTT T CAGTAGCG 


2320 


J / OJ 




1076 ' 


CATCTTTT C AGTAGCGT 


2321 


5787 


AUUAACGC CUGAUGAG X CGAA ACUGAAAA 


1077 


TTTTCAGT A GCGTTAAT 


2322 


5792 


GCAUAAUU CUGAUGAG X CGAA ACGCUACU 


1078 


AGTAGCGT T AATTATGC 


2323 


5793 


AGCAUAAU CUGAUGAG X CGAA AACGCUAC 


1079 


GTAGCGTT A ATTATGCT 


2324 


5796 


CAGAGCAU CUGAUGAG X CGAA AUUAACGC 


1080 


GCGTTAAT T ATGCTCTG 


2325 


5797 


ACAGAGCA CUGAUGAG X CGAA AAUUAACG 


I08I 


CGriAAIT A TGCTCTGT 


2326 
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5»02j UGGAAACACUGAaCAGXCGAAAGCAUAAU 



1082 



5808 
5818 



5819 

5820 



UAUUUGGA CU GAUGAG X CGAA ACAGAOCA J083 
UUAUUUGG CUGAUGAG X CGAA AACAGAGC 1084 



UUCAUUUG CUGAUGAG X CGAA AAACAGAG 1085 



OUAAAGGA CUGAUGAG X CGAA AUGCAGUU 1086 
UUUAAAGG CUGAUGAG X CGAA AAUGCAGU 1087 " 



UUUUAAAG CUGAUGAG X CGA A AAAUGCAG 



1088 



58;^ j CAAUUGGA CUGAUGAG X COAA AGGAAAUG | 1089 
5824 



5825 
5830 



UCAAUUGG CUGAUGAG X CGAA AAGGAAAU | 1090 " 
UUCAAUUG CUGAUGAG X CGAA AAAGGAAA" 



ATTAJGC! C TGTTTCCA. 

iuctctgtttccaactg" 

OtlClbil I CCAACTGC 



CiUiU'lTrCCAAClGCA 2330" 



AAUlUCATT JCCincC 2331 



ACKjcArrxcci j i ccA 2332" 



Cl UCA ITT CCincCAA 2333 



1091 



CAlllLCl i rCCAATTG j 2334 
A 1 1 1 CCTT T CCAATTGA | 2335^ 



riKJUiTrcCAAllUAA 2336" 



5835 



UUUAAUUC CUGAUGAG X CGAA AUUGGAAA 
UAUACUUU CUGAUGAG X CGAA AUUCAAUU 



1092 
l09r 



TTICCAATTGAATTAAA 
AAH UAAT T AAAGTGTG 



1094 



(JUAAAAAC CUGAUGAG X CGAA AGGCCACA 



AlrGAATTAAAUlGTGG 2339" 



585 



UUACUAAA CUGAUGAG X CGAA ACGAGGCC 



1095 
T09r 



IGlGGCCTCGlrrfTAG"! 
uucCTCGTTri-rAGTCA | 234r 



J097 



GcuiUjrm lAGlCAT 2342 



5853 
5854 



1098 



tCiCGin TTAGTCATT 



AAAUGACU CUGAUGAG X CGAA AAAACGAG" 



1099 



CJCGM IT l AGTCATTT 



U AAAUGAC CUGAUGAG X CGAA AAAAACGA 



I too 



TCG urn A GTCATTTA 



5858 



UUUUAAAU CUGAUGAG X CGAA ACUAAAAA" 



UOl 



TnilAGl CATTTAAAA 



CAAUUUUA CUGAUGAG X CGAA AUGACUAA 



1102 



ACAAUUUU CUGAUGAG X CGAA AAUGACUA 
AACAAUUU CUGAUGAG X CGAA AAAUGACU 



5868 
5871 



UauaAAAC CUGAUGAG X CGAA AUUUUAAA 



1103 

TToT 
TToT 



ACUuAUAA CUGAUGAG X CGAA ACAAUUUU 



1106 



5873 
5874 
5876 



UACUUAGA CUGAUGAG X CGAA AACAAUUU | 1 107 " 
UUACUUAG CUGAUGAG X CGAA AAACAAUU I 1108 
AUUACUUA CUGAUGAG X CGAA AAAACAAU TTOT 
CAAUUACU CUGAUGAG X CGAA AGAAAACA [" I \W 



5880 



UCAGCAAU CUGAUGAG X CGAA ACUUAGAA 



Mil 



5883 



GAUUCAGC CUGAUGAG X CGAA AUUACUUA 



TTAGlCAl TTAAAATTG 



TAGlCAm AAAATTGf 
AUICA HT A AAArrOTf 



TTIAAAATTGITITCTA 



AAAAliGTTTTCTAAGT 



AAA! 1 G 1 1 T TCTAAGTA" 
AAJlCim'TCTAAGTAA 



AilGiTITCTAAGTAAT 
■lOllllCl AAGTAATTG 



TTC 1 AAGi A ATTGCTGC" 



M12 



TAAGTAATTGCTGCCTC" 



2343 



2344 



2345 



2346 



2350 



2351 



2352 



2353 

i35r 
i35r 



2356 



2357 
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5891 


CCAUAAUA CUGAUGAG X CGAA AGGCAGCA 


1113 


TGCTGCCT C TATTATGG 


2358 II 


5893 


UGCCAUAA CUGAUGAG X CGAA AGAGGCAG 


1114- 


CTGCCTCT A TTATGGCA 


2359 1 


5895 


AGUGCCAU CUGAUGAG X CGAA AUAGAGGC 


1115 


GCCTCTAT T ATGGCACT 


2360 


5896 


AAGUGCCA CUGAUGAG X CGAA AAUAGAGG 


III6 


CCTCTATT A TGGCACTT 


2361 


5904 


CAAAAUUG CUGAUGAG X CGAA AGUGCCAU 


1117 


ATGGCACT T CAATTTTG 


2362 


j5905 


GCAAAAUU CUGAUGAG X CGAA AAGUGCCA 


1118 


TGGCACIT C AATTTTGC 


2363 


|5909 


CAGUGCAA CUGAUGAG X CGAA AUUGAAGU 

• 


1119 


ACTTCAAT T TTGCACTG 


2364 


5910 


ACAGUGCA CUGAUGAG X CGAA AAUUGAAG 


1120 


CTTCAATT T TGCACTGT 


2365 


5911 


GACAGUGC CUGAUGAG X CGAA AAAUUGAA 


1121 


TTCAATTT T GCACTGTC 


2366 


5919 


UCUCAAAA CUGAUGAG X CGAA ACAGUGCA 


1122 


TGCACTGT C TTTTGAGA 


2367 


5921 


AAUCUCAA CUGAUGAG X CGAA AGACAGUG 


1123 


CACTGTCT T TTGAGATT 


2368 


5922 


GAAUCUCA CUGAUGAG X CGAA AAGACAGU 


1124 


ACTGTCIT T TGAGATTC 


2369 


5923 


UGAAUCUC CUGAUGAG X CGAA AAAGACAG 


1125 


CTGTCTTT T GAGATTCA 


2370 


5929 


UUUUCUUG CUGAUGAG X CGAA AUCUCAAA 


1126 


TTTGAG AT T CAAG AAAA 


2371 


5930 


UUUUUCUU CUGAUGAG X CGAA AAUCUCAA 


1127 


TTGAGATT C AAG AAAAA 


2372 


5940 


UGAAUAGA CUGAUGAG X CGAA AUUUUUCU 


1128 


AG AAAAAT T TCTATTCA 


2373 


5941 


AUGAAUAG CUGAUGAG X CGAA AAUUUUUC 


1129 


GAAAAATT T CTATTCAT 


2374 


5942 


AAUGAAUA CUGAUGAG X CGAA AAAUUUUU 


1130 


AAAAATTT C TATTCATT 


2375 


5944 


AAAAUGAA CUGAUGAG X CGAA AGAAAUUU 


1131 


AAATTTCT A TTC ATTTT 


2376 


5946 


AAAAAAUG CUGAUGAG X CGAA AUAGAAAU 


1132 


AriTCTAT r CAri rrn 


2377 


5947 


AAAAAAAU CUGAUGAG X CGAA AAUAGAAA 


1133 


TTTCTAIT C ATTlTrTT 


2378 


5950 


UGCAAAAA CUGAUGAG X CGAA AUGAAUAG 


1134. 


CTATTCAT T TTTTTGCA 


2379 


5951 


AUGCAAAA CUGAUGAG X CGAA AAUGAAUA 


I135> 


TATTCATT T TTTTGCAT 


2380 


5952 


GAUGCAAA CUGAUGAG X CGAA AAAUG AAU 


1136 


ATTCATTT T TTTGCATC 


2381 


5953 


GGAUGCAA CUGAUGAG X CGAA AAAAUGAA 


1137 


TTC ATTTT T TTGCATCC 


2382 




UOUAUUCA CUUAUijAu A CuAA AAAAAUUA 


1 138 


TCATTTTT T TGCATCCA 


2383 


5955 


UUGGAUGC CUGAUGAG X CGAA AAAAAAUG 


1139 


CATTTTTT T GCATCCAA 


2384 


5960 


CACAAUUG CUGAUGAG X CGAA AUGCAAAA 


1140 


TTTTGCAT C CAAITGTG 


2385 


5965 


UCAGGCAC CUGAUGAG X CGAA AUUGGAUG 


1141 


CATCCAAT T GTGCCTGA 


2386 


5977 


UAUUUUAA CUGAUGAG X CGAA AGUUCAGG 


1142 


CCTG AACT T TTAAAATA 


2387 


5978 


AUAUUUUA CUGAUGAG X CGAA AAGUUCAG 


1143 


CTGAACTT T TAAAATAT 


2388 
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397 


9 CAUAUUUU CUGAUGAG X CGAA AAAGUUCA 


1144 


TGAACm- T AAAATATG 


2389 


598 


3 ACAUAUUU CUGAUGAG X CGAA AAAAGUUC 


1145 


GAACi ill A AAATATGT 


2390 


598] 


5 CAUUUACA CUGAUGAG X CGAA AUUUUAAA 


1146 


riTAAAAT A TGTAAATG 


2391 


5985 


} GCAGCAUU CUGAUGAG X CGAA ACAUAUUU 


1147 


AAATATGT A AATGCTGC 


2392 


6003 


; GGGUUUGG CUGAUGAG X CGAA ACAUGGCA 


1148 


TGCCATGT T CCAAACCC 


2393 


6004 


UGGGUUUG CUGAUGAG X CGAA AACAUGGC 


1149 


GCCATGTT C CAAACCCA 


2394 


60M 


ACACUGAC CUGAUGAG X CGAA AUGGGUUU 


1150 


AAACCCAT C GTCAGTGT 


2395 1 


6017 


CACACACU CUGAUGAG X CGAA ACGAUGGG 


1151 


CCCATCGT C AGTGTGTG 


2396 


6029 


CAGCUCUA CUGAUGAG X CGAA ACACACAC 


1152 


GTGTGTGT T TAGAGCTG 


2397 


6030 


ACAGCUCU CUGAUGAG X CGAA AACACACA 


1 153 


TGTGTGTT T AGAGCTGT 


2398 


6031 


CACAGCUC CUGAUGAG X CGAA AAACACAC 


1154 


GTGTGTTT A GAGCTGTG 


2399 j 


6046 


GUUGUUUC CUGAUGAG X CGAA AGGGUGCA" 


1155 


TGCACCCT A GAAACAAC 


2400 


6057 


GGGACAAG CUGAUGAG X CGAA AUGUUGUU 


1156 


AACAACAT A CTTGTCCC 


2401 


6060 


CAUGGGAC CUGAUGAG X CGAA AGUAUGUU 


1157 


AACATACT T GTCCC ATG 


2402 


6063 


GCUCAUGG CUGAUGAG X CGAA ACAAGUAU 


1158 


ATACTTGT C CCATGAGC 


2403~ 


6095 


UGAAUGCA CUGAUGAG X CGAA AGGGGUCU 


1159 


AGACCCCT T TGCATTCA 


2404 


6096 


GUGAAUGC CUGAUGAG X CGAA AAGGGGUC 


1160 


GACCCCTT T GCATTCAC 


2405 


6101 


UCUCUGUG CUGAUGAG X CGAA AUGCAAAG 


1161 


CI ri'GCAT T CACAGAGA 


2406 


6102 


CUCUCUGU CUGAUGAG X CGAA AAUGCAAA " 


1162 


rn'GCATT C ACAGAGAG 


2407 


6113 


UAACCAAU CUGAUGAG X CGAA ACCUCUCU 


1163 


AGAGAGGT C ATTGGTTA 


2408 


6116 


CUAUAACC CUGAUGAG X CGAA AUGACCUC 


1164 


GAGGTCAT T GGTTATAG 


2409 


6120 


GUCUCUAU CUGAUGAG X CGAA ACCAAUGA 


1165 


TCATTGGT T ATAGAGAC 


2410 


6121 


AGUCUCUA CUGAUGAG X CGAA AACCAAUG 


1166 


CATTGGTT A TAGAGACT 


2411 


6123 


CAAGUCUC CUGAUGAG X CGAA AUAACCAA "" 


1167 


TTGGTTAT A GAGACTTG 


2412 


6130 


AUUAAUUC CUGAUGAG X CGAA AGUCUCUA 


1168 


TAGAGACT T GAAITAAT 


2413 


6135 


CACUUAUU CUGAUGAG X CGAA AUUCAAGU 




ACl lUAAl I AATAAGTG 


2414 


6136 


UCACUUAU CUGAUGAG X CGAA AAUUCAAG ' 


1170 


CriGAATT A ATAAGTGA 


2415 


6139 


AUGUCACU CUGAUGAG X CGAA AUUAAUUC 


1171 


UAATTAAT A AGTGAC AT 


2416 


6148 


ACUGGCAU CUGAUGAG X CGAA AUGUCACU " " 


1172 


AGTGACAT T ATGCCAGT 


2417 


6149 


AACUGGCA CUGAUGAG X CGAA AAUGUCAC" " 


1173 


GTGACATT A TGCCAGrr 


2418 


6157 


AGAACAGA CUGAUGAG X CGAA ACUGGCAU 


1174 


ATGCCAG J r TCTGTTCT 


2419 
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£. 1 CO 

0158 


GAGAACAG CUGAUGAG X CGAA AACUGGCA 


1175 


TGCCAGTT T CTGTTCTC 


2420 1 


6159 


AGAGAACA CUGAUGAG X CGAA AAACUGGC 


1176 


GCCAGTTr C TGTTCTCT 


2421 


6163 


UGUGAGAG CUGAUGAG X CGAA ACAGAAAC 


1177 


G rrrcTGT t ctctcaca 


2422 


6164 


CUGUGAGA CUGAUGAG X CGAA AACAGAAA 


1 178' 


rn C 1 GTT C TCTCACAG 


2423 


6166 


ACCUGUGA CUGAUGAG X CGAA AGAACAGA 


1179 


TCTGTTCT C TCACAGGT 


2424 


6168 


UCACCUGU CUGAUGAG X CGAA AGAGAACA 


1180 


TGTTCTCT C ACAGGTGA 


2425 


6178 


GCAUUGUU CUGAUGAG X CGAA AUCACCUG 


1181 


CAGGTGAT A AACAATGC 


2426 


6188 


UGCACAAA CUGAUGAG X CGAA AGCAUUGU 


1182 


ACAATGCT T TTTGTGCA 


2427 


6189 


GUGCACAA CUGAUGAG X CGAA AAGCAUUG 


1183 


CAATGCTT T TTGTGCAC 


2428 1 


6190 


AGUGCACA CUGAUGAG X CGAA AAAGCAUU 


1184 


AATGCTTT T TGTGCACT 


2429 1 


6191 


UAGUGCAC CUGAUGAG X CGAA AAAAGCAU 


1185 


ATGcrnr r gtgcacta 


2430 1 


6199 


AGAGUAUG CUGAUGAG X CGAA AGUGCACA 


1186 


TGTGCACT A CATACTCT 


2431 


6203 


CUGAAGAG CUGAUGAG X CGAA AUGUAGUG 


1187 


CACTACAT A CTCTTCAG 


2432 


6206 


ACACUGAA CUGAUGAG X CGAA AGUAUGUA 


1188 


TACATACT C TTCAGTGT 


2433 


6208 


CUACACUG CUGAUGAG X CGAA AGAGUAUG 


1189 


CATACTCT T CAGTGTAG 


2434 


6209 


UCUACACU CUGAUGAG X CGAA AAGAGUAU 


1190 


ATACTCTT C AGTGTAGA 


2435. 


«,5 


AAGAGCUC CUGAUGAG X CGAA ACACUGAA 


1191 


TTCAGTGT A GAGCTCTT 


2436 


6221 


UAAAACAA CUGAUGAG X CGAA AGCUCUAC 


1192 


GTAGAGCT C TTGTTTl A 


2437 


6223 


CAUAAAAC CUGAUGAG X CGAA AGAGCUCU 


1193 


AGAGCTCT T GITTTATG 


2438 


6226 


UCCCAUAA CUGAUGAG X CGAA ACAAGAGC 


1194 


GCTCTTGT T TTATGGGA 


2439'* 


6227 


UUCCCAUA CUGAUGAG X CGAA AACAAGAG 


1195 


CTCTTGTT T TATGGGAA 


2440 


6228 


UUUCCCAU CUGAUGAG X CGAA AAACAAGA 


1196 


TCTTGTTT T ATGGGAAA 


2441 


6229 


UUUUCCCA CUGAUGAG X CGAA AAAACAAG 


1197 


CTTGTTTT A TGGGAAAA 


2442 


6242 


UGGCAUUU CUGAUGAG X CGAA AGCCUUUU 


1198 


AAAAGGCT C AAATGCCA 


2443 


6254 


UCAAACAC CUGAUGAG X CGAA AUUUGGCA 


1199 


TGCCAAAT T GTGITTGA 


2444 


6259 


AUCCAUCA CUGAUGAG X CGAA ACACAAUU 


1200 


AATTOTf^T T THAT fid AT 


Oil A C 


6260 


AAUCCAUC CUGAUGAG X CGAA AACACAAU 


1201- 


ATTGTGTT T GATGGATT 


2446 


6268 


GGCAUAUU CUGAUGAG X CGAA AUCCAUCA 


1202' 


TGATGGAT T AATATGCC 


- 2447 


6269 


GGGCAUAU CUGAUGAG X CGAA AAUCCAUC 


1203 


GATGGATT A ATATGCCC 


2448 


6272 


AAAGGGCA CUGAUGAG X CGAA AUUAAUCC 


1204 


GGATTAAT A TGCCCn^I 


2449 


6279 


AUCGGCAA CUGAUGAG X CGAA AGGGCAUA 


1205 


TAIGCCC r T TTGCCGAT 


2450 
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CAUCGGCA CUGAUGAG X CGAA AAGGGCAU 
w»i ^^AUCGGCCUGAUGAGXCGAAAAAGGGCA 
\t>^yA AUUAAUAGCUGAUGAGXCGAAAUGCAUCG 



t>z^:>\ AUCAGUAA CUGAUGAG X CGAA AGUAUGCA 
WW ACAUCAGU CUGAUGAG X CGAA AUAGUAUG 
^ACAUCAO CUGAUGAG X CGAA AAUAGUAU 



ACAAAACC CUGAUGAG X CGAA AGUCACAU 



IbJ 14 UGCGACA A CUGAUGAG X CGAA ACCGAGUC " 
CUGCGAC A CUGAUGAG X CGAA AACCGAGU 



0JI6 GCUGCGAC CUGAUGAG X CGAA AAACCGAG- 



W ly AAAGCUGC CUGAUGAG X CGAA ACAAAACC 
^si(y\ ACAAAGCA CUGAUGAG X CGAA AGCUGCGA 



1206 
1207 



1208 



1209 



1212 



1213 
TJlT 



AJUCCC7TTTGCCGATG 
iOCCCilTTGCCGATGC 



CGATGCAT A CTATTACT 



lUCATACTA'ITACTGAT 
CAi ALTAI TACTGATGT 



AiALlA'lTACTGATGTG 



2451 
1452 
'245T 



2454 
2455 



AiUJGACTCGGTTTTGt 



■oAClCGGrTTTGTCGCA 
ACICGGTTTTGTCGCAG 



AACAAAUL CUGAUGAG X CGAA AAGCUGCG 

WJl| AUUAAACA CUGAUGAG X CGAA AGCAAAGC 

6332 



1215 
1216* 
T2T7 
T2T8 



1219 



LlCUGTTTTGTCGCAGC 
uuili TGTCGCAGCnf 

icGCAGcn JGci rrGT 

UGCAGCTT T GCTTTGTT 



uuiJiCiCTTTGTTTAAT 



2456 



2457 



2458 



2459 
2460 
^46r 
l46r 



2463 
■246r 



CUGAUGAG X CGAAAAGCAAAG 
633:>| UUUCAUU A CUGAUGAG X CGAA ACAAAGCA 
DJJO CiUUUCAUU CUGAUGAG X CGAA AACAAAGC 



1220 
T22r 



CniGCJJ I Gi'ITAATG 
J UCriTGT T TAATGAAA" 



2465 



2466 



1222 



o.. / UUUUUCA U CUGAUGAG X CGAA AAACAAAG 
AUGUUUA C CUGAUGAG X CGAA AGUGUGUU 



GCTTTGTT T AATG AAAC 



2467 



1223 



CMlGiU AATGAAACA 



2468 



oj^A AAGAGGUU CUGAUGAG X CGAA ACAAGUGU 



1224 



AACACACT T GTAAACCT 



2469 



635V GUGCAAAA CUGAUGAG X CGAA AGGUUUAC 



1225 
T226" 



ACACH GI AAACCTCTt 



63d, AAGUGCAA CUGAUGAG X CGAA AGAGGUUU 



GTAAACCT C TTTTGCAC 



6302 AAAGUGCA CUGAUGAG X CGAA AAGAGGUU 

o:»oj GAAAGUGC CUGAUGAG X CGAA AAAGAGGU 

o^oy CUUUUUCA CUGAUGAG X CGAA AGUGCAAA 

6370 UCUUUUUC C UGAUGAG X CGAA AAGUGCAA 



6^a I UCCCGCUG CUGAUGAG X C(JAA AUUCUUUU 



63V4 AGGUGCUC CUGAUGAG X CGAA AGCAUCCC " 
o^UD AAAUUGU U CUGAUGAG X CGAA ACAGGUGC 
041 J GUUGAGAA CUGAUGAG X CGAA AUUGUUUA 
o^ul UUUUGAG A CUGAUGAG X CGAA AAUUGUUU 



1227 
T228" 
T229" 
7230" 



1232 



1233 



1234 



AAACCTCT T TTGCACTI 
AACCTCTT T TGCACTTf 
ACClCirj'TGCACTTTG 
illUCACTTTGAAAAAG 
ITGCACTTTGAAAAAGA 



AAAAGAA I C CAGCGGGA 



GGGATGCT C GAGCACCF 



UCACCTGT A AACAATtf 



1236 



TAAACAAT T TTCTCAAC 



AAACAAl i TTCTCAACC" 



2470 



2471 



2472 



2476 



2477 



2478 



2479 



2480 



2481 
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5 



10 



|64I4 


AGGUUGAG CUGAUGAG X CGAA AAAUUGUU 


1237 


A AC A A TTT T CTC* A A C*r*T 
/\r\\^f\J\ 111 1 U 1 L/AA^C 1 


2482 1 


6415 


UAGGUUGA CUGAUGAG X CGAA AAAAUUGU 


1238' 


Ar^A ATnT* C TC* A. AC*r*T A 


2483 


6417 


AAUAGGUU CUGAUGAG X CGAA AGAAAAUU 




A A'l"l''l"'r/^'F O A A r^r^T' a 'i* i « 

AAilLiCi LAALCIATT 


2484 


6423 


ACAUCAAA CUGAUGAG X CGAA AGGUUGAG 


1240 V 


CTCAACCT A TTTG ATGT 


2485 


6425 


G AACAUCA CUGAUGAG X CGAA AUAGGUUG 


1241 


CAACCTAT T TGATGTTC 


. 2486 


6426 


UGAACAUC CUGAUGAG X CGAA AAUAGGUU 


1242 


AACCTATT T GATGTTCA 


2487 


6432 


UUUAUUUG CUGAUGAG X CGAA ACAUCAAA 


1243 


H I OA TGT T CAAATAAA 
• 


2488 


6433 


CUUUAUUU CUGAUGAG X CGAA AACAUCAA 


1244 


TTGATGTT C AAATAAAG 


2489 


6438 


UAAUUCUU CUGAUGAG X CGAA AUUUGAAC 


1245 


GTTCAAAT A AAGAATTA 


2490 
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1 558 


GUACAUUACCUGGUA 


J.jK}y 


CUOC CGCC GCCAAC 


2622 




584 


GUACAirUACCUGGUA 




GGTC AGAC CGGCCT 


2623 


5 




ACinndA An A A nni ir* AcnAf^Ar' a a a/t* a v 
GUACAUUACCUGGUA 


251 1 


GACC GGCC TCCCCT 


2624 






ACOCClC* ACwAA (W\AC\ ACC AClAflA A AC* A v 

GUACAUUACCUGGUA 


2512 


CI AC GGCC CCGGGT 


2625 


• 




^.^VIUUvivJ Vj/VAl^ AL^l^ALrAVjAAAUA A 

GLUACAUUACCUGGUA 


2513 


G 1 1 U GGCT CCAACG 


2626 


10 




r*r*r*ncA AriAAnniir^ Af^r'AriA/^A a a/^a v" 

^\rfV^\j^A AOAA VfUUO AL^l^AOAOAAACA A 

GUACAUUACCUGGUA 


2514 


CAAC GGCC TGGGGG 


2627 




680 


AOCAiiP AfrAA nftPii AprAf;Ar;A A Ar*A v 

AVvJVrfrtUV^ rWJrvn \J\J\^\J /lv*V.*A\JA\JAAAL*A A 

GUACAUUACCUGGUA 


^315 


AGCC CGCT GATGCT 


2628 




683 


AOlIAfiP AHA A ClCClCt ACCACiAClAAAr'A V 

GUACAUUACCUGGUA 




CCGC TGAT GCTACT 


2629 




1 inociCiC* AO A A nCwi in Ar*r*Arj a/^ a a ap a v 

U\JV/WVJ^ AUAA VJVJUVJ Av*^AVJAOAAAL^A A 

GUACAUUACCUGGUA 


2517 


CACC CGCC GCCGCA 


2630 




7ft 1 


Af^r*i\fu^ ACiAA nr*nn appa/^ap a a a pa V 

Avi\^UOvy AOAA \JL.\JLr AC^^AOAvjAAALrA A 

GUACAUUACCUGGUA 


2518 


CCGC CGCC GCAGCT 


2631 


20 




ncif^ciAC AriA a nlrrici app AriAp a a apa v 

OVJ^OA^-. AOAA ij^VlVJ AL.CAUAOAAAL.A A 

GUACAUUACCUGGUA 


2519 


CCGC AGCT GTCGCC 


2632 




/ lU 


AAA fine A A A nci in a pp apapaaa^^a v 

A/\/\LHJv^ AvJAA \jL*UO AUCAUAOAAAcA A 

GUACAUUACCUGGUA 


2520 


CAGC TGTC GCCm 


2633 






PPniTfiP APA A PPAP APP AP AP A A Ar»A "V 
Cv*\jU\jVj AIjAA OL/ALr AV^v^AUAUAAAUA A 

GUACAUUACCUGGUA 


2521 


CTGC AGCC CCACGG 


2634 


25 




pprsppn An A A ncir*c appapapaaapa v 

^^k'VlVil^O AOAA OO^Lf ACv*AvjAOAAA^A A 

GUACAUUACCUGGUA 


2522 


GGCC GGCC CGCCGG 


2635 






A A I IPPP A P A A PPPP A PP APAPAAAPA V 
AkiAA UOv^l^ AL-V^AVjAoAAAC/A A 

GUACAUUACCUGGUA 


2523 


OGCC CGCC GGCATT 


2636 




1078 


PPAPPP AOAA GCiCC APPAPAPAAAPA Y 

GUACAUUACCUGGUA 


'>OA 

Z324 


GOCC TGCC GGCTCC 


2637 


30 


1082 


iiiinrnn aoaa nnrA appapahaaapa y 
GUACAUUACCUGGUA 




TGCC GGCT CCGCAA 


2638 




1206 


GUCUCC AGAA GACC ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2526 


GGTC TGCT GGAGAC 


2639 




1244 


AUCAUG AGAA GGCU ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2527 


AGCC CGCTCATGAT 


2640 




1273 


AGGCCA AGAA GUUC ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2528 


GAAC AGCC TGGCCT 


2641 




1289 


UCGGCC AGAA GGGA ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2529 


TCCC TGAC GGCCGA 


2642 
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2226 


GUACAUUACCUGGUA 


2551 


ATTC TGTC TCCTGC 


2664 1 




2301 


AC*\]A Ad AfiAA CiAC\r* ACT' An Ar* A a a/^a v 
GUACAUUACCUGGUA 


. 2552 


GCTC AGTT CTTAGT 


2665 


5 


2322 


APAriAA AriAA fZAAfZ Af^f^ AC* Ai^ A A A/^ a 'v 

Ai^AOAA AUAA UAAO ALCAuAOAAACA X 
GUACAUUACCUGGUA 


2553 


CTTC TGTC TTCTGT 


2666 1 




2329 


uuu^L.c AUAA UAAu ACCAvjAOAAACA A 
GUACAUUACCUGGUA 


2554 


CTTC TGTT GGGAAC 


2667 




2373 


A A A/^An ACv A A (W\\ \ A A CC^ AC' AC K A Af* K V 

A/VAVf AU AVJAA O U UA ALC AOAOAAACA A 
GUACAUUACCUGGUA 


2555 


TAAC AGCT CTCTTT 


2668 


10 


3420 


C\AC\\W\C* ACL A A film A A A r« A a"a a a v 

vjAUUUv^ AuAA uUuA ACCAOAGAAACA X 
GUACAUUACCUGGUA 


2556 


TCAC AGCT GAACTC 


2669 




241Q 


i^i^i^AU A AUAA UAUU ACCAOAOAAACA X 
GUACAUUACCUGGUA 


2557 


ACTC AGTC TATGGG 


2670 


1 c 




I tAClt^an AC A A nCC\ I a CC a/^a/^aaa^^* v' 

UAUUUU AUAA UUCU ACCAOAOAAACA X 
GUACAUUACCUGGUA 


2558 


AGGC AGAT CCCCTA 


2671 




C\ \d m A Af\A A nC AC a CC a/^a/^aa a r> a vf 

CUCUUA AUAA uuAu ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2559 


CTGCAGATTCAGAG 


2672 






UI-UAUA AUAA UAUC ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2560 


GCTC TGCC TCTGGC 


2673 


20 




A C" A A A A A A A C A A A A A A A A #^ « m# 

AUAAAA AUAA GG AA ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2561 


TTCCTGATTTTTGT 


2674 






I m A A n A A C A CC a A/^a^aaa/^av 

UUt^UUA AUAA UACL ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2562 


GGTC AGCT TCAGGA 


2675 






f^Ar^i irm An a a cci m a c*c^ a/^ a r> a a a/-<a v" 
CACUUU AUAA UUUC ACCAOAOAAACA X 

GUACAUUACCUGGUA 


2563 


G ACC TGTT CCAGTG 


2676 


25 


3083 


Af^ ACii irs An A A m n ir^ a cc* a/^a/~'aa a/^a v 
A^AUUU AUAA UUUC ACCAGAGAAACA X 

GUACAUUACCUGGUA 


2564 


GAAC TGTT CACTGT 


2677 




3242 


CiCJ]C ACw AfiAA fii TAT I ACnAnAnA A Ar*A V 

VJV^UI^/\VJ /VUAA UUAU AI».|^AUAUAAACA A 

GUACAUUACCUGGUA 


2565 


A7 AC AGIT CTGAGC 


2678 




3258 


CiACif^AA AflA A nnC\ 1 A CC AC. AC A A Ar* A V 
UAUv^AA AUAA UOV»,U ACCAUAUAAACA A 

GUACAUUACCUGGUA 


2566 


AGCC AGAC TTGCTC 


2679 


30 


3312 


IinTYifin AflA A npA A Ar^r*Ar'A/^A a a/^a v 

V/Uv»UUU AUAA UVrfAA ACCAUAUAAACA A 

GUACAUUACCUGGUA 


2567 


TTGC AGAC CCCGCA 


2680 




3360 


AAUAAG AGAA GGAC ACCAGAGAAACA X 
GUACAUUACCUGGUA 


. 2568 


GTCC AGCT CTTATT 


2681 




3402 


CUUGAC AGAA GCUU ACCAGAGAAACA X 
GUACAUUACCUGGUA 


. 2569 


AAGC AGCT GTCAAG 


2682 




3420 


GAACAC AGAA GUCU ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2570 


A(iAC AGCTGTGTTC 


2683 




3475 


GACAGC AGAA GUCC ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2571 


GGAC CGTT GCTGTC 


2684 
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IS 



20 



25 



30 



~UALOUA AUAA GACC AC CAUAGAAACA X 
GUACAUUACCUGGUA 



AGUGGA AGAAGUAO ACCAGAOAAACA 
GUACAUUACCUGGUA 

~C.AAAUo AGAA OGCA AC CAGAUAAAGa Ic 
GUACAUUACCUGGUA "^^^^ 



™uuAA AUAA GCCC ACCAGAGAAACAT 
GUACAUUACCUGGUA 



I GUACAUUACCUGGUA 



'^^/S^ TAGCtGCfCG GGCT 2692" 



^ 'J'^^ACA AGAA GAAA^CCAGAGAAACAX 

I GUACAUUACCUGGUA 251 IJlLlOAl lOlCCA 269r 

^ U.i.UtA AOAA OCAA ACCAGAGAAACajT 
I GUACAUUACCUGGUA 



^^^^1 ^^^^^^^^S^^AUACCAGAG}^ 

I GUACAUUACCUGGUA I 



^IJAOGC AGAA GGAU ACCAGAGAAACAX K«J 

I GUACAUUACCUGGUA AlCCAGAi ucciAT 2696" 



I AGAA GAUG AOTagagaAACA X ^ 

I GUACAUUACCUGGUA p^^nCATCAOAlWfGA^^ 



^1 ^^=^^AA<-' AGAA GCCAACCAGAGAAACAX 1 , , 

GUACAUUACCUGGUA ^ MGGCTGAI 01 lOGI 2698 



^^^^^ AGAA GAGG ALL AUAGAAACA ir--4^;5r-L-- 

I GUACAUUACCUGGUA P^^^'^ '^I^I ITCCCC ^2699 



^UCCAGAGAAGAAGACC^ L_J 

I GUACAUUACCUGGUA P^' K"^' 'GCCCTGGAGf^ 



^^^\ AACUGA AGAA OLCA ACCA(iAf^ a a AnT^ L— — 

I GUACAUuiccS^^^^^^^^J^^^ 



■^'J^UAO AUAA OAAG ACCAGAGAAACAX k.«o L L 

I GUACAUUACCUGGUA -!i»9 ™AG11l1AGAG 2702" 



'^ULAAA AGAA GLAL ALLAUAGAAAC A Y k, J 

GUACAUUACCU^UA P^" N'^<- AGlCTrrGArhTOr 



"^^n AGAAWGUAC LAGAOAAACA y h^T^ . 

GUACAUUACCUGGUA^^^^ T^^^nACArfGAmcA^^ 

^ ^'^^ AGAA UCCU ACCAGAGAAACAX , „, 

I GUACAUUACaJGGJJA^^^^^^ AGGC AGAT CTGCTrHTOS 
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10 



20 



15595 


UCCCCA AGAA GAUC ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2593 


GATC TGCT TGGGGA 




5803 


GUUGGA AGAA G AGC ACCAGAGAAACA X 
GUACAUUACCUGGUA 


V 2594 


GCTC TfiTT TCC A A p 


J. iXSt 


5886 


AAUAGA AGAA GCAA ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2595 


I'KiC TGCC TPTATT 




5916 


UCAAAA AGAA GUGC ACCAGAGAAACA X 

GUACAUUACCUGGUA 


2596 


GCAC TGTC TTTTfiA 


17AO 


6087 


AAAGGG AGAA GUGU ACCAGAGAAACA X 
GUACAUUACCUGGUA . 


2597 


APAP AHAP PPPTTT 


271U 


6154 


AACAGA AGAA GGCA ACCAGAGAAACA X 
GUACAUUACCUGGUA 


"2598 


TfiPP AnTTTPTriTX 
1 U^V^ /IVJ 11 l^lljll 


11 \ 1 


6160 


UGAGAG AGAA GAAA ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2599 


TTTr lY^TT PTPTPA 




6284 


GUAUGC AGAA GCAA ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2600 


TTGC CG AT GCATAC 


2713 


6300 


AGUCAC AGAA GUAA ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2601 


TTACTGATGTGACT 


2714 


6311 


CGACAA AGAA GAGU ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2602 


ACTC GGTT TTGTCG 


2715 


6322 


AAGCAA AGAA GCG A ACCAGAGAAACA X 
GUACAUUACCUGGUA 


2603 


TCGC AGCT TTGCTT 


2716 



25 



30 
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Table VII. Antisense Nucleic Acid Molecules Targeting c-raf 



5 



10 



15 



20 



Seq. 
I.D. No. 


Sequence 


2731 


CsgsasauugCsAsTsCsCsTsGsAsAsacagsasasA 


2732 


gsUsasCCtgAsTsTsC^GsCsTsGsTsgacUsUsCsG 


2733 


gsCga^ccagCaAsCaAaGsAsCsTsTsaccUsgsasT 


2734 


UsasgsCagcCsCsTsGsAsGsCsCsTsUacCsUegsG 


2735 


CsasgsgaucTsGsAgAsAsCsAsAsAsgccCsasasG 


2736 


UsgsCsCaucTsTsTsA^C^GsA^AsCsCaaCsCsCsA 


2737 


gsUsgsgtcaGsCTsGsTsGsCsAsAsGsCauUsgsasT 


2738 


TsCsCsCsGsCsCsTsGsTsGsAsCgAsTsGsCsAsTsTs 


2739 


T3C3C3G3CSC3G3T3C3T5C3A3GSA3TSCSG3A3TST3 


2740 


agucccgCsCsTsGsTsGsAsCsAsUgcauuc 


2741 


asgsUsCccgCsCsTsGsTsGsAsCsAsUgcausUsCs 




asgaU3CCcgC9C3T3G3TsG3A3CsAsUgcaUsUsCs 


2743 


iB agucccgCsCsTsGsTsGsAsCsAsUgcauuc iB 


2744 


agucccgCsCsTsGsTsGsAsCsAsugcauuc iB 


2745 


gauccgcCsGsTsCsTsCsAsGsAsUcgaucu 


2746 


gsasUsCcgcCsGsTsCsTsCsAsGsAsUcgasUgCgU 


2747 


iB gauccgcCsGsTsCsTsCsAsGsAsUcgaucu iB 


2748 


gaueegcCaGsTsCsTsCsAsGsAsUcgaucu iB 



lower case = 2'-0-methyl nucleotides; UPPER Case = DNA; s = phosphorothioate linkage; 
in= 5 methyl C;iB=inverted abasic;boId lower case=2'-0-methylthiomethyi modified 
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10 



15 



20 



25 



30 



35 



Table Vra. AHtisense Nucleic Acid Molecules Targeting BcW and K-ra. 



lower 




wer case = 2^0-methyl nucleotides; UPPtR Case - DNJ a • e - u r 
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. Claims 

1 . A nucleic acid molecule having the fomiula I: 
wherein each of X represents independently a nucleotide which may be same or different; 

5' C (X)fir-(Y) — (X) — C 3- 

5 where m and o are integers independently greater than or equal ta 5; (X)n, and (X)o are 
oligonucleotides which are of sufficient length to stably interact independently with a 
target nucleic acid molecule; Y represents mdependently a deoxyribonucleotide which 
may be same or different; (¥)„ is an oligonucleotide which is of sufficient length to stably 
interact independently with a target nucleic acid molecule; n is an integer greater than or 

10 equal to 4; represents a chemical linkage; each (X)m, and (X)o comprise independently 

at least one phosphodiester linkage and one phosphorothioate linkage; (¥)„ comprises a 
phosphorothioate linkage or a phosphorodithioate linkage or a 5'-thiophosphate linkage or 
a mixture thereof; and each C and C independently represents a cap structure which may 
independently be present or absent. 

15 2. A nucleic acid molecule having the formula II: 

5' C— (Y)n (Xh^C 3' 

wherein X represents a nucleotide which may be same or different; where r is an integer 
greater than or equal to 4; (X)r is an oligonucleotide which is of sufficient length to stably 
interact independently with a target nucleic acid molecule; Y represents independently a 

20 deoxyribonucleotide which may be same or different; n is an integer greater than or equal 
to 4; (Y)n is an oligonucleotide which is of sufficient length to stably interact 

independently with a target nucleic acid molecule represents a chemical linkage; 

(Y)n comprises a phosphorothioate Imkage or a phosphorodithioate linkage or a 5'- 
thiophosphate linkage or a mixture thereof; each (X)r.comprises independently at least one 

25 phosphodiester linkage and one phosphorothioate linkage; and each C and C 
independently represents a cap structure which may independently be present or absent. 

3. A nucleic acid molecule having the formula III: 
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10 



5- C—{X}f ^^(Y)„ c 3" 

«*«ei„ X represents a n.cleoMe ^ch may be same o, diffcem; where r is an integer 

/v\ • , muict^uie. ^ 'presents a chemical linlcaaf*- 

Oa <»mp„ses a phosphoro4i<..e linkage or a phospho,«,iU,„a« linkage or a mi^^; 
^Phoro, ,„a« ^ ph„sph„™di«„a,e Unka.es; each (X> compriseslde;:.^ 
leas, one phosphod,es,er linkage one phcphoroAioate lidcage; aad eaerH^C 
.«..pe.^e.,,,.prese«sacaps«^„..„M0Kmayi„depe„de„,,Lp,esen7^^ 

fonnJlV: ^ ™° """'""'^ ""^^ ""ivfy °' 

5' C~(X)— (Z)p— (X)„-(Y)ri— (X)-c- 3- 



15 vAerein each of X represent i-Klependently a nucleodde which may be same or diffen=„.. 
where m. o and q are inr^ers indepe«le«iy greater or equal ,„ 5- (X) and x 
ohgor-ncleo^des Which are of sofflciem ,e„g«, . 

2tl acrd molecnle; (X), is oprional.y able . inierac. „i.h a .:ge. nliT*: 

20 Y represents independently a deoxyribontKleotid. which may be .same or differ^,- (Y, is 
gonucleottde which is of snfflcien. ,e„g«. to stably interact i™iepe„de.,ly 

ta^etnucle,cac,dmolecule;„isanin.e8ergreaterthan„requal,„4- reorlT 
ctanica, linkage; each (X,„, m. and (X), comprise indep^d^;;!, 7Z „„ 

25 n^Tr 'T''- ' PhosphorotJoate L^ge or 

25 phosphonxhthroa^ linkage or a S'-thiophosphate linkage or a tnixture thereof Z 
™ an ohgonucleotide able to Militate the cleavage of the target „ ^ acid 
molecule; p ,s of length gteater than or e,M, to 4; and each C and C ind^L: 
n=presents a cap str^cmre which may independently be present or absent 
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5. An enzymatic nucleic acid molecule having endonuclease activity of the 
formula V: „ 

5" C (Xk-(Y)„-(X)o-(Z)p-{X)— C 3' 

wherein each of X represents independently a nucleotide which may be same or different; 
5 where m, o and q are integers independently greater than or equal to .5; (X)n, and (X)o are 
oligonucleotides which are of sufficient length to stably interact independently with a 
target nucleic acid molecule; (X)q is optionally able to interact with the target nucleic acid 
molecule; (Y)„ represents independently a deoxyribonucleotide v/hich may be same or 
different; (Y)n is an oligonucleotide which is of sufficient length to stably interact 

10 independently with a target nucleic acid molecule; n is an integer greater than or equal to 

4; represents a chemical linkage; each (X)m, (X)o and (X)q comprise independently at 

least one phosphodiester linkage; (¥)„ comprises a phosphorothioate linkage or a 
phosphorodithioate linkage or a 5'-thiophosphate linkage or a mixture thereof; Z 
represents an oligonucleotide able to facilitate the cleavage of the target nucleic acid 

15 molecule; p is of length greater than or equal to 4; and each C and C independently 
represents a cap structure which may independently be present or absent, 

6. An enzymatic nucleic acid molecule having endonuclease activity of the 
formula VI: 

5" C (Y)n (Z)p (X)q— C 3" 

20 wherein X represents independently a nucleotide which may be same or different; where q 
is an integer independently greater than or equal to 1; (X)q is optionally able to interact 
with a target nucleic acid molecule; Y represents independently a deoxyribonucleotide 
which may be same or different; (¥)„ is an oligonucleotide which is of sufficient length to 
stably interact independently with a target nucleic acid molecule; n is an integer greater 

25 than or equal to 4; represents a chemical linkage; (¥)„ comprises a phosphorothioate 

linkage or a phosphorodithioate linkage or a 5'-thiophosphate linkage or a mixture thereof; 
Z represents an oligonucleotide able to facilitate the cleavage of the target nucleic acid 



wo 99/54459 



PCTAJS99/08547 



110 



molecule; P « of lengft g,eater or e,^ to 4; a«l each C C iMiependendv 

5' C (X)q (2)p (Y)„ C- 3' 

wherein X represents independently a nucleotide which may be same or 4. 
is an integer independently gteater than or cttal to 1 ■ m ^ T " 

with a faro^* „. ^1 j . 4"'" lo J , (A^, IS optioDally aWc to interact 

^vitfi a target nucleic ac.d molecule; Y represents independently a deoxyribonucleotide 

phosphorodithioate linkage or a 3' «? nh«.«k ,• , 

«K„ u J-,. J -!>-phosphorothioate linkage or a I'-S 

.5 ' '^'^ ^ - o^onuceotidc ab e 

equal to 4 ami each C and C independentfy represems a cap s,nK*« which mTy 
"Kiependently be present or absent. ^"ire wmcn may 

8 The nucleic acid molecule of any of claims 1-3, wherein each X 

"idependently comprises a nucleotide modification selected ^""^ ^ 

20 r-O-methyl 2'.Oallvl Tn ,^ , , °° '""^"='"^°"' 8™"? "W'Ming of: 
2 -O-allyl, 2 -0-methylthiomelhyl, L-nucleotides; 2'-(--allvl- 1-5 

An^d^he^l; 2.6-diaminop™,e; 2-.fiuo..; J'^eoxy-a.-amino; T^N-^^^'J^. 

IZ^y^- ^■^■W-'anyl) amino; 2..deoxy-2-- lysyl) at^irT' 

O^tno; -D»xy-2--(Ar-his.id,., amino; 6-me.hyl uridine; 5-methyl c>I,I e 2' W- 
25 l=^'7'''7;'^-^'>™»°-2.-d«.xy.n„cleotide; T-O-me Jthi^yl; 

rr^r' "--'^ r-«.-.hiophoite ^:d 

both X and Z, mdepcndcntly comprises a nucleotide modification selected ih,m ,1,, 
group consisting of 2'-0-methvl T O all„l o- n .u , ,.. 
30 all»l- I s A„., J , , ^ "-""y'- 2 -0-methylthiomethyl, L-nuclcotides; 2--C- 

aUyl, l-5.Anhyd,ohexy,ol; 2,6-diami„opurine; 2--fl„„ro; 2-.deoxy-2--amino; 2-.H; 2M^- 
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alanyl) amino; 2'-(A/-phenylalanyl)amino; 2'-deoxy-2'-(iV-p.alanyl) amino; 2'-deoxy-2'. 
(lysyl) amino; 2'.0-amino; 2'-Deoxy-2'-(//-histidyl) amino; 6-methyl uridine; 5-methyl 
cytidine; 2'-(//-P-carboxamidine-P-aIanyl)amino-2'-deoxy-nucieotide; 2'-0- 

methylthioailyl; 2*-0-methylthioethyl; 2'-0-methylthiomethyl; 2'-0-methyl-3'- 
5 thiophosphate and xylofiiranosyl. 

10. The enzymatic nucleic acid molecule of any of claims 4-7, wherein, the Z 
in said nucleic acid molecule is the catalytic core. 

11. The enzymatic nucleic acid molecule of any of claims 4-7, wherein said 
enzymatic nucleic acid is in a hammerhead ribozyme configuration. 

10 12. The enzymatic nucleic acid molecule of any of claims 4-7, wherein said 

enzymatic nucleic acid is in a hairpin ribozyme configuration. 

13. The enzymatic nucleic acid of any of claims 4-7, wherein said enzymatic 
nucleic acid is in a hepatitis delta virus, group I intron, VS RNA, group II intron, or RNase 
P RNA configuration. 

15 1 4. A method of cleaving an RNA molecule comprising the step of, contacting 

the enzymatic nucleic acid molecule of any of claims 4-7, with the RNA molecule under 
conditions suitable for the cleavage of said RNA molecule by said enzymatic nucleic acid 
molecule. 

15. The method of claim 14, wherein said cleavage is carried out in the 
20 presence of a divalent cation. 

1 6. The method of claim 1 5, wherein said divalent cation is Mg^"^. 

17. The nucleic acid molecules of any of claims 1-3, wherein said C, when 
present, is a cap selected fi-om the group consisting of: inverted abasic residue; 4',5*- 
methylene nucleotide; l-(beta-D-erythrofuranosyl) nucleotide; 4'-thio nucleotide, 

25 carbocyclic nucleotide; 1,5-anhydrohexitol nucleotide; L-nucleotides; alpha-nucleotides; 
modified base nucleotide; phosphorodithioatc linkage; //ireo-pentofiiranosyl nucleotide; 
acyclic 3*,4*-seco nucleotide; acyclic 3,4-dihydroxybutyl nucleotide; acyclic 3,5- 
dihydroxypentyl nucleotide, 3'-3 -inverted nucleotide moiety; 3*-3'-inverted abasic moiety; 



wo 99/54459 



PCT/US99/08547 



112 



3'-2'-inverted nucleotide moiety; 3'-2Wed abasic moiety; 1,4-butan.^io, phosphate; 
3 -phosphor hexylphosphate; aminohexyl phosphate; 3--phosphate; 3-. 
phosphorothioate; phosphorodithioate; and methylphosphonate moiety. 

18. The nucleic acid molecule of any of claim 1-3. wherein said nucleic acid 
5 molecule compnses a 3'-3' linked inverted ribose moiety at said 3' end. 

19. The nucleic acid molecule of any of claims 1-3, whergin said nucleic acid 
molecule is an antisense nucleic acid molecule. 

I, , . sf ■ . T""" '''^'^^ ^^^^'^ '^"'^l^i^ molecule 

IS a 2-5 A antisense chimera. 



10 



21. The nucleic acid molecule of any of claims 1-3 wherein said nucleic acid 
molecule is a tnplex forming oUgonuclcotide. 



22. 

Nos. 1-30. 



A nucleic acid molecule comprising any of sequence defmed as Seq. ID 

15 claimsZ? i"«='"ding an enzymatic nucleic acid molecule of any of 

24. The mammalian cell of claim 23. wherein said mammalian cell is a human 



cell. 



25. 



A method of modulating the expression of a gene in a cell comprising the 
step of admmistenng to said cell a nucleic acid molecule of any of claims 1-3 under 
20 conditions smtable for the down regulation of said gene. 



26. A method of modulating the expression of a gene in a cell comprising the 
step of admimstering to said cell an enzymatic nucleic acid molecule of any of claims 4-7 
under conditions suitable for the down regulation of said 



gene. 



27. The enzymatic nucleic acid molecule of any of claims 4-7. wherein said C 
when present, is a cap selected from the group consisting of: inverted abasic residue- 4' 5'- 
methylene nucleotide; l-(beta-D-erythroluranosyi) nucleotide; 4--thio nucleotide 
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carbocyclic nucleotide; 1,5-axihydrohexitol nucleotide; L-nucleotides; alpha-nucleotides; 
modified base nucleotide; phosphorodithioate linkage; /Areo-pentofiiranosyl nucleotide; 
acyclic 3*,4'-seco nucleotide; acyclic 3,4-dihydroxybutyl nucleotide; acyclic 3,5- 
dihydroxypentyl nucleotide, 3 -3'-inverted nucleotide moiety; 3*-3 -inverted abasic moiety; 
5 3-2'-inverted nucleotide moiety; 3'-2'-inverted abasic moiety; 1,4-butanediol phosphate; 
3*-phosphoramidate; hexylphosphate; aminohexyl phosphate; 3-phosphate; 3'- 
phosphorothioate; phosphorodithioate; and methylphosphonate moiety. 

28. The enzymatic nucleic acid molecule of any of claims 4-7, wherein said C, 
when present, is a cap selected from the group consisting of: 4^5'-methylene nucleotide; 1- 

10 (beta-D-erythrofuranosyl) nucleotide; 4*-thio nucleotide, carbocyclic nucleotide; 5*-amino- 
alkyl phosphate; l,3-diamino-2-propyl phosphate, 3-aminopropyl phosphate; 6- 
aminohexyl phosphate; 1,2-amuiododecyl phosphate; hydroxypropyl phosphate; 1,5- 
anhydrohexitol nucleotide; L-nucleotide; alpha-nucleotide; modified base nucleotide; 
phosphorodithioate; /Areo-pentofiiranosyl nucleotide; acyclic 3',4'-seco nucleotide; 3,4- 

15 dihydroxybutyl nucleotide; 3,5-dihydroxypentyl nucleotide, 5-5*-inverted nucleotide 
moeity; 5-5'-invcrted abasic moeity; 5-phosphoramidate; 5'-phosphorothioate; 1,4- 
butanediol phosphate; 5'-aniino; bridging and/or non-bridguig 5-phosphoramidate, 
phosphorothioate and/or phosphorodithioate, bridging or non bridging methylphosphonate 
and 5 -mercapto moeities. 

20 29. The nucleic acid molecules of any of claims 1-3, wherein said C, when 

present, is a cap selected from the group consisting of: 4',5'-methylene nucleotide; l-(beta' 
D-erythrofliranosyl) nucleotide; 4'-thio nucleotide, carbocyclic nucleotide; 5*-amino-alkyl 
phosphate; l,3-diamino-2-propyl phosphate, 3-aminopropyl phosphate; 6-aminohexyl 
phosphate; 1,2-aminododecyl phosphate; hydroxypropyl phosphate; 1,5-anhydrohexitol 

25 nucleotide; L-nucleotide; alpha-nucleotide; modified base nucleotide; phosphorodithioate; 
r/ireo-pentofuranosyl nucleotide; acyclic 3',4-seco nucleotide; 3,4-dihydroxybutyl 
nucleotide; 3,5-dihydroxypentyl nucleotide, 5'-5 -inverted nucleotide moeity; 5'-5'-inverted 
abasic moeity; 5-phosphoramidate; 5 -phosphorothioate; 1,4-butanediol phosphate; 5'- 
amino; bridging and/or non-bridging 5-phosphoramidate, phosphorothioate and/or 

30 phosphorodithioate, bridging or non bridging methylphosphonate and 5'-mercapto 
moieties. 



30. 



A manunalian cell including a nucleic acid molecule of any of claims 1-3. 
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cell. 



31. Tlxe mamnuUian celJ of cJaun 30, whe.i„ said .a.n,alian ce„ is a hun,an 



32. The enzymatic nucleic acid molecule of any of claim. 4.7 , • • 
enzymatic nucleic acid molecule comprises a 3' 3' H„i..h ! ' ^^"^"^ '"^^ 

5 end. ^ ^"^^ ribose moiety at said 3' 

33. The nucleic acid molecule of anv of HaJm« 1 1 u • 

are symmetric in length. * ^'^ (X)o 

34. ITie nucleic acid molecule of any of claims 1 1 „,K • 

are asymmetric in length. ' and (X)„ 

nucleic acid molecul*. nf »n„ I • 

36. The enzymatic nucleic acid molecule of ««« i • 
(X). and (X), are a^»«ttc in l=„gd,. ' °' 

n and ^ „f „, 

39. The enzymatic nucleic acid molecule of claim 4 wherein fh.c. , . 
n and q is greater than said m. '"^ °^ 

20 

40. Tl"^ 'nzymalic nucleic acid molecule of claim 5 whoW.,.1, , . 
n and m is greater than said q. ' '™ 

»and;s\ess^Z:''""^'''^^"°''='''"*-"'«-'"'^-of^aido, 
25 . and m'Le.s^X" - "f-i- 
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43. An enzymatic nucleic acid molecule with RNA cleaving activity, wherein 
said enzymatic nucleic acid molecule modulates the expression of an estrogen receptor 
gene. 

44. The enzymatic nucleic acid molecule of claim 43, wherein said enzymatic 
5 nucleic acid molecule is in a hammerhead configuration. 

^ 45. ITie enzymatic nucleic acid molecule of claim 44, wherein said enzymatic 
nucleic acid molecule comprises a stem II region of length greater than or equal to 2 base 
pairs. 

46. The enzymatic nucleic acid molecule of claim 43, wherein said enzymatic * 
10 nucleic acid molecule is in a hairpin configuration. 

47. The enzymatic nucleic acid molecule of claim 43, wherein said enzymatic 
nucleic acid is in a hepatitis d virus, group I intron, group II intron, VS nucleic acid or 
RNase P nucleic acid configuration. 

48. The enzymatic nucleic acid of claim 46, wherein said enzymatic nucleic 
1 5 acid molecule comprises a stem II region of length between three and seven base-pairs. 

49. The enzymatic nucleic acid molecule of claim 43, wherein said nucleic acid 
comprises between 12 and 100 bases complementary to said RNA. 

50. The enzymatic nucleic acid molecule of claim 43, wherein said nucleic acid 
comprises between 14 and 24 bases complementary to said mRNA. 

20 51. The enzymatic nucleic acid molecule of claim 44, wherein said enzymatic 

nucleic acid molecule consists essentially of any sequence defined as Seq ID Nos 1-1245. 

52. A mammalian cell mcluding an enzymatic nucleic acid molecule of any of 
claim 43. 



53. 

25 cell. 



The mammalian cell of claim 52, wherein said mammalian cell is a human 
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54. An 



5 

cell. 



An expression vector comprising nucleic aciH 
one enzymatic nucleic acid molecule of clL 4^ '"""^'"^ 
thatenzymaticnucleicacidrlcl 

55. A mammalian cell including an expression vector of claim 54. 

56. 7.e mammalian cell of claim 13. wherein said mammalian cell is a human 



Patient:-en.r:r::~r^^^^^ 
10 PatienttL.p:r„tltlT 

43;.dc)mtroducingsaidcell!:a::::^^^^^ 
15 molecultlclaL^r^"'^^ ™c nucleic acid 

level oflogen ^^^"^ ^ associated with the 

molecule of cL 4^ " '""'"^^^^^^^ -^-tic nucleic acid 



20 level ofLgen rri^co^ ' '^""^ ^ ---^cd with the 
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64. The eniymatic nucleic acid of claim 63, wherein said nucleic acid 
comprises a 3-3' Imked inverted ribose moiety at said 3' end. 

65. The enzymatic nucleic acid molecule of claim 44, wherein said nucleic acid 
molecule comprises at least five ribose residues, and wherein said nucleic acid molecule 

5 comprises phosphorothioate linkages at at least three of the 5' terminal nucleotides, and 
wherein said nucleic acid comprises a 2'-amino modification at position No. 4 and/or at 
position No. 7 of said nucleic acid molecule, wherein said nucleic acid, molecule 
comprises at least ten 2'-0-methyl modifications, and wherein said nucleic acid comprises 
a 3 ' - end modification. 

1 9 Th® enzymatic nucleic acid molecule of claim 44, wherein said nucleic acid 
molecule comprises at least five ribose residues, and wherein said nucleic acid molecule 
comprises phosphorothioate linkages at at least three of the 5* terminal nucleotides, and 
wherein said nucleic acid molecule comprises an abasic substitution at position No. 4 
and/or at position No. 7 of said nucleic acid molecule, wherein said nucleic acid comprises 

1 5 at least ten 2 -0-methyl modifications, and wherein said nucleic acid molecule comprises a 
3*-end modification. 

67. The enzymatic nucleic acid molecule of claim 44, wherein said nucleic acid 
molecule comprises of at least five ribose residues, and wherein said nucleic acid 
comprises phosphorothioate linkages at at least three of the 5' terminal nucleotides, and 

20 wherein said nucleic acid molecule comprises a 6-methyl uridine substitution at position 
No. 4 and/or at position No. 7 of said nucleic acid molecule, wherein said nucleic acid 
molecule comprises at least ten 2'-0-methyl modifications, and wherein said nucleic acid 
molecule comprises a 3* end modification. 

68. A method for modulating expression of estrogen receptor gene in a 
25 manmialian cell by administering to said cell the enzymatic nucleic acid molecule of claim 

43. 

69. A method of cleaving a separate RNA molecule comprising, contacting the 
enzymatic nucleic acid molecule of claim 43 with^ said separate RNA molecule under 
conditions suitable for the cleavage of said separate RNA molecule. 
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70. The method of claim 69 wherein 

presence of a divalent cation. " ^"^^ « 

71. The method of claim 70. wherein said divalent cation is Mg2+ 

5 chemic^tsynt^!:^'' ^^-^^ -'^ic acid is 

73. The expression vector of claim 54, wherein said vector comprises- 
a; a transcnption initiation region; 
b) a transcription temination region; 
^ c) a gene encoding at least one said nucleic acid molecule; and 

74. The expression vector of claim 54. wherein said vector comprises- 

a) a transcnption initiation region; 

b) a transcription termination region; 

c) an open reading frame- 

75. The expression vector of claim 54, wherein said vector comprises: 

a) a transcription initiation region; 

b) a transcription termination region; 

c) an intron; 

d) a gene encoding at least one said nucleic acid molecule; and 



76. 



Tlie expression vector of claim 54, wherdn said vector comprises: 
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a) a transcription initiation region; 

b) a transcription termination region; 

c) an intron; 

d) an open reading frame; 

5 e) a gene encoding at least one said nucleic acid molecule, wherein 

said gene is operably linked to the 3'-cnd of said open reading frame; and 

wherein said gene is operably linked to said initiation region, said intron, 
said open reading frame and said termination region, in a manner which allows expression 
and/or delivery of said nucleic acid molecule. 

10 77. The enzymatic nucleic acid molecule of claim 44, wherein said enzymatic 

nucleic acid comprises sequences that are complementary to any of sequences defined as 
SeqIDNos 1246-2490, 

78. The enzymatic nucleic acid molecule of claim 46, wherein said enzymatic 
nucleic acid molecule consists essentially of any sequence defined as Seq ED Nos 2491- 

15 2603. 

79. The enzymatic nucleic acid molecule of claim 46, wherein said enzymatic 
nucleic acid comprises sequences that are complementary to any of sequences defined as 
Seq ID Nos 2604-2716.. 

80. The enzymatic nucleic acid molecule of claim 43, wherein said enzymatic 
20 nucleic acid is a DNA enzyme. 

81. The enzymatic nucleic acid molecule of claim 43, wherein said enzymatic 
nucleic acid comprises at least one 2'-sugar modification. 

82. The enzymatic nucleic acid molecule of claim 43, wherein said enzymatic 
nucleic acid comprises at least one nucleic acid base modification. 

25 83. The enzymatic nucleic acid molecule of claim 43, wherein said enzymatic 

nucleic acid comprises at least one phosphorothioate modification. 
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84. The 



enzymatic nucleic acid molecule of any of claim. 4 7 u 
enzymatic nucleic acid molecule comprises a 3--3' linked inveldT : 
said 3' end. '"^^^ted deoxynbose moiety at 

85. 'The nucleic acid molecule of any of claim* 1 ^ u ■ 

86. The nucleic acid molecule of any of claims 1 ^ «a • 

10 said 5- end. '"^^^ '"^^^^'^ deoxyribose moiety at 

88. The nucleic acid molecule of any of claim, n u ■ 




OR, 

IX 



group or a phosphorus-containing group. Penaemiy an O, blocking 



89. The nucleic acid molecule of any of claim, i i u • 
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E 



Wherein, each B is independently a modified or an unmodified nucleic acid base; Rl is 
independently an aklyl, an alkylthioalkyl, a fluoroalkyl or an alkylthiofluoroalkyl; E is 
5 independently a phosphorus-containing group; and D is independently an O, blocking 
group or a phosphorus-containing group. 

90. The nucleic acid molecule of any of claims 4-7, wherein each X, Z or both 
X and Z, independently comprise a nucleotide modification having formula X: 




10 

Wherein, each B is independently a modified or an unmodified nucleic acid base; Rl is 
independently an aklyl, an alkylthioalkyl, a fluoroalkyl or an alkylthiofluoroalkyl; E is 
independently a phosphorus-containing group; and D is independently an O, blocking 
group or a phosphorus-containing group. 



15 



91. The nucleic acid molecule of any of claims 4-7, wherein each X, Z or both 
X and Z, independently comprises a nucleotide modification having fomiula IX: 
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D. 




E 



OR, 

IX 



92. The nucleic acid molecule of any of claims \ -i «r,A a n ^ 
comprises a phosphorothioate linkage. ^^"'^^ (^)« 

93. The nucleic acid molecule of any of claim*: n ^ - 
comprises a phosphorodithioate linkage. ' '''' ^^''""^ 

94. The nucleic acid molecule of any of claims n »nH a -7 u • 
comprises a 5'-S-phosphorothioate linkage. <Y>" 

95. The nucleic acid molecule of any of claims 1 ^ -7 . 
co»si«sofph„.ph„„«,„3.ea„l^e«eve,yp«ta ' """"" 

96. The nucleic acid molecule of any of claim<! ] a n ^ 

97. The nucleic acid molecule of any of claims ) i .^a a n . 

98. Tie nucleic acid molecule of any of claims 1-3 and 4-7 wherein m 
oompnses a combinaUo. of pho.pl,o,o«.„a.e and phosphorodi,hioate li^l^eT 

99. The nucleic acid molecule of any of claims 11 =,„H ^ -7 u • 
comp..esacom.ina.o„ofpHospho,o,«oa.a„d;CoX.te;^^^^^ 
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100. The nucleic acid molecule of any of claims 1-3 and 4-7, wherein (¥)„ 
comprises a combination of phosphorodithioate and 5'-S-phosphorothioate linkages. 

101. The nucleic acid molecule of any of claims 1-3 and 4-7, wherein (Y)n 
comprises a combination of phosphorothioate, phosphorodithioate and 5'-S- 
phosphorothioate linkages. 

102. The nucleic acid molecule of any of claims 1-3 and 4-7, wherein (¥)„ 
comprises a combination of phosphorothioate and 5'-S-phosphorodithioate linkages. 

103. The nucleic acid molecule of any of claims 1-3 and 4-7, wherein (Y)n 
comprises a combination of phosphorodithioate and 5'-S-phosphorodithioate linkages. 

104. The nucleic acid molecule of any of claims 1-3 and 4-7, wherein (¥)„ 
comprises a combination of phosphorothioate and 3'-S-phosphorothioate linkages. 

105. The nucleic acid molecule of any of claims 1-3 and 4-7, wherein (Y)n 
comprises a combination of phosphorodithioate and 3'-S-phosphorothioate linkages. 

106. The nucleic acid molecule of any of claims 1-3 and 4-7, wherein (Y)n 
comprises a combination of phosphorothioate and 3'-S-phosphorodithioate linkages. 

107. The nucleic acid molecule of any of claims 1-3 and 4-7, wherein (Y)n 
comprises a combination of phosphorodithioate and 3'-S-phosphorodithioate linkages. 
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